March 20, 1961 


X3-2 Sub- Committee 
on 

Character Sets & Data Formats 
of the 

American Standards Association 
X3 Sectional Committee 
Sponsored by 
O.E.M.I. 

MINUTES 

4b MEETING OF MARCH 8th & 9th, 1961 


SUMMARY OF MINUTES 

A large portion of the meeting was devoted to discussion and study of the problems 
associated with a standard six level, 64 character set. Capt. Luebbert gave a 
lecture outlining the results of his study of this subject. These deliberations 
resulted in the identification of two proposed character sets which are to be 
studied by the membership with the aim that a recommended character set can be 
identified at the April meeting. 

Proposed standards for perforated paper tape have been submitted to the Committee, 
by other organizations. It was decided to table discussion on these items until 
that time at which the committee undertakes a study of formats and media. 


C. E. Macon 
Secretary 

CEM: aks 




1.3 Observers Present 


1.4 Resignation 



R. W. Berner 
Robert Gryb 
W. M. Kahn 
W. F. Luebbert 


Mr. John H. Craig has 
voluntarily resigned 
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Membership Modification 

Notification of Automatic Resignation 

Letters had been sent to Mr. Weber and Mr. Schoenberg informing 
them of the Membership and Participation Rules adopted at the 
January 11th and 12th meeting of the committee. 

Mr. Weber's letter of 2/28/61 was read in which he acknowledged 
receipt of the letter from the c~~:iittee and, as a result of 
studying the rules, submitted hi~ resignation. Mr. Weber requested 
that he be allowed to attend as an observer. The committee ap- 
proved Mr. Weber's resignation and request for observer status. 

Since Mr. Schoenberg had not replied to the original letter the 
secretary was requested to inform him of the Membership and Parti- 
cipation Rules by registered mail. 

User List 

Mr. Auwaerter will submit to the chairman a list of the principle 
communication user groups by the next meeting. 

There are data processing user groups of a general nature in the 
process of formation (AFIPS & JUG). The chairman will investigate 
this matter and report at the next meeting. 

It was noted that ATA had been invited to join the committee but 
that no reply had been received as yet. 


2. Minutes 

The Minutes of the February 9th and 10th meeting were read and approved. 


3. Documents 


3.1 Documents Distributed 

E.X.A. Tentative Standards Proposal for Eleven Sixteenths Inch Wide 
Five- Track Perforated Paper Tape. 

3 1,2 AIEE Standard for Perforated Tape (Preliminary). 


3.1.3 



3.1.4 


Report "On the Design of Extended Character Sets" 
by R. W. Berner 

Report "Design of an Improved Transmission/Data Processing Code" 
by Berner, Smith & Williams 

NOTE: Replacement copies of the documents distributed to date 
can be obtained from the Secretary of the Committee. 
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The Secretary was directed to obtain for the Committee copies of the 
report: "Information Handling and Processing In Large Communication 

Systems" by W.F. Luebbert* 

Perforated Paper Tape Standards 

The subject of the EiA Standards Proposal on One- Inch Perforated Paper Tape 
distributed at the February meeting was brought forward. The documents 
listed in 3.1.1 and 3.1.2 were distributed. After some discussion the 
following motion was approved by the Committee: • ' 

A. The documents on Perforated Paper Tape received by the Committee 
be acknowledged as input information toward that time when the 
Committee shall consider the subject of Perforated Paper Tape. 

B. The subject of standards for perforated paper tape be tabled until 
that point at which the Committee considers media and format. 

Six Bit Code Discussion 

A six bit code was presented (refer to document under 3.1.4 above). The 
general subject was discussed at some length and Capt. Leubbert presented 
a lecture based on his studies of the problem. In addition to the points 
listed under Section 6 of the February Minutes the following points were 
brought forward: 

From a communication's point of view it is desirable that codes associ- 
ated with "double purpose" keys differ in only one binary position for 
a given key. 

Summary of Discussion on Collating Sequence. 

The collating sequence of a code is of primary importance in sorting 
and forms the basis for the structure of files. In sorting and 
establishing files all or only a portion of the available character 
set may be used depending on the Sort Key adopted. 

Sort Keys 

The basic keys believed to be in use are: 

a) Numeric 

b) Alpha 

c) Alpha 6c Numeric 

c) Alpha & Numeric 6c Special 

If either a numeric only or an alpha only sort key is used then the 
sequence of the numeric sub- set or the sequence of the alpha sub- set 
is of prime importance, but the relationship of the numeric sub-set 
to the alpha is not. In addition, if an alpha-numeric sort key is 
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used and the fields arc defined and purely alpha or numeric the 
same statement holds. 

In short, the exact collating sequence of the total character set 
is of importance if and only if the sort key includes mixed fields 
of the total character set. 

5.2.4 Importance of Collating Sequence 

• • 

It was noted that from a purely logical point of view virtually any 
well ordered and defined sequence is satisfactory if new files are 
being sorted and established. In the case of established files, 
however, changing the sort key could require a considerable re- 
sorting task which would be costly. It is believed that for one 
large group of data processing users that about 60% of this group 
are using either pure alpha or pure numeric keys and hence will not 
be effected unless the normal sequence of these sub-sets is dis- 
turbed. Of the remaining 40% which use alpha and numeric and limited 
specials, the majority employ a special, alpha, numeric sequence. 

5.3 General Six Bit Set 

In considering a General Six Level Set for communications and data 
processing, it seems desirable the five level set derived from collaps- 
£ ing the six level set provide for something less than 64 characters. 

Since 6 special function characters are required (i.e., Master space, 
blank, carriage retux*n, line feed, escape and delete) and, as presently 
envisioned, the corresponding 6 characters In the six level code will 
be lost in the "collapsing” process, it appears that 52 characters are 
available for general six level interchange of information. 

6. First Draft Proposals - Homework 

Two six level codes were identified (see Table 1 and 2) which seemed to 
accomplish most of the objectives. These are to be studied so that a 
single proposed six level code can be identified at the next meeting. 

7.. Communications 

■ In his letter of March 3, 1961, Mr. Liggett acknowledged establishment of 
Mr. Robert J. Rossheim as the coordinator between X3-2 and X3-6. 



8. Next Meeting 

The next meeting will be held April 12th and 13th. The exact time and 
place will be stipulated in the forthcoming Agenda. 

6-S-M- - 

C. E. Macon 
Secretary 


CEM: aks 
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First Proposal for 
Standard Code Set by ASA X3-2 


Octal 

Character 

Octal 

Character 

Octal 

Character 

Octal 

Character 

00 

Master Space 

20 

0 

40 

(to be assigned) 

60 

M 

01 

blank 

21 

1 

41 

(to be assigned) 

61 

N 

02 

Carriage Return 

22 

2 

42 

(to be assigned) 

62 

O 

03 

Line Feed 

23 

# 3 

43 

(to be assigned) 

63 

P 

04 

# 

24 

4 

44 

A 

64 

Q 

05 

% 

25 

5 

45 

B 

65 

R 

06 


26 

6 

46 

C 

66 

S 

07 


27 

7 

47 

D 

67 

T 

10 

( 

30 

8 

50 

E 

70 

U 

11 

) 

31 

9 

51 

F 

71 

V 

12 

/ 

32 

$ 

52 

G 

72 

W 

13 

’ 

33 

* 

53 

H 

73 

X 

14 


34 

(to be assigned) 

54 

I 

74 

Y 

15 

+ 

35 

(to be assigned) 

55 

J 

75 

Z 

16 


36 

Upper Case 

56 

K 

76 

Escape 

17 

> 

37 

Lower Case 

57 

L 

77 

Delete 


The following points are features of this code: 

1) The Alphabetics are in a 5-bit subset 

2) The format effectors (MS, Blank, CR, and LF) collate first and are easy to logically recognize 

3) Master space precedes all characters so that it is sure to collate first 

4) All punctuation marks collate before numerics and alphabetics 

5) Two symbols ($ and *) unique to the U.S. occupy codes 32 and 33 following the numeric 0-9 so that 

compatibility with a British duo-decimal code is achieved. -~I~J 

6) By collapsing the low order six-bit subset into a four-bit subset, codes 20-33 and §4437 become a 
convenient numeric subset 

7) If a numeric 4-bit subset or an alphanumeric 5-bit subset is desired, certain pending positions in each 
set are available for control or format effectors (i.e. the function assigned octal 00-03) can be 
assigned to and the similarity between octal 36-37 and octal 76-77 allow them to be interchangeable. 

8) Delete is all ones (octal 77) to allow paper tape punch-out 

9) Six code positions are available for special applications 

Mr. Tom McNamara, the Honeywell delegate to the EXA TR 27.6 and the alternate to X3-2, also 
attended. 





CERTIFICATION OF ALGORITHM 20 
REAL EXPONENTIAL INTEGRAL (S. Peavy, Comm. 
ACM, Oct. 1900) 

William J. Alexander* and Henry C. Thacker, Jr.* 
Argonne National Laboratory, Argonne, Illinois 

Expint. (x) was programmed for the LGP-30 computer, using 
both a 7S floating-point compiler (ACT III) and an 8S floating- 
point interpretive code (24.2). Constants given to more than 7S 
i or to 88 for the 24.2 program) were rounded to 7S (or 8S). 

After (‘hanging the constant .(X)551990S to .055199158, both pro- 
grams gave acceptable accuracy over the range tested. 

The sS (24.2) program was compared with the 9D values given 
for — K , ( — x ) in Mathematical Tables Project, Table. s* of Sine, 
Cosine, and Exponential Integrals, Volume II (1940/ for the 
<et of values x = 0.1 (0.1 )1 .0(1.0)10.0. The largest discrepancy found 
was — Hi X 10~ s for x = 0.1. For x greater than 1, all values tested 
were good to SS. 

For computing real values of the exponential integral, this 
algorithm is much faster than EKZ ; Algorithm 13). For x < 1, 
the ratio of speeds was of the* order of 20. 

Work supported by the l .S. Atomic Energy Commission. 


CERTIFICATION OF ALGORITHM 43 

CIlOl’T (L (Henry C. Tluicher, .Jr., Comm. ACM, 1900) 

Henry C. Tiiachek, Jr.* 

Argonne Xational Laboratory, Argonne, Illinois 

CHOL'T II was coded bv hand for the Royal Precision LGP-30 
computer, using a 28-bit muntisa floating point interpretive 
system (24.2 modified). 

The program was tested against the linear system: 
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Standards 


Further Survey of Punched Card Code 


H. AIcG. Ross, Ferranti Ltd., London 


% 

V 

i 


det = —1045.4499. All elements of Ab — y were less than 10~ 7 in 
magnitude. Identical results were obtained with the same b, 
and repeat true. With the same b and the last row vector of A 
replaced by (19.1927, 33.4409, 25.1298, 5.2811), i.e. A 4, j = A 1 j, 
+ 2A 2, j — 3A 3, j, the results were: 
det = 0.10924352 X 10" ! , 

y = (0.29214425 X 10 8 , -0.12131172 X 10 s , 0.72411923 X 10 7 , 
-0.51018392 X 10 7 ) 

Failure to recognize this singular matrix is due to roundoff, either 
in the data input or in the calculation. 


The valuable “Survey of Punched Card Codes” p 
pared by Smith and Williams {Comm. ACM 3, Dec. 196 
63S) unfortunately omits the card codes of Europea 
equipment, other than IBM. These are presented in tl 
table on page 181. This information has been extracte 
from a Ferranti publication, “Collected Information a 
Punched Card Codes” (List CS 266) and has been set o 
in much the same way as the table by Smith and Willia 

A valuable step forward has been made by the Britis 
Standards Institution in publishing Standard 3174 
“Alpha-Numeric Punching Codes for Data Processinj 
Cards”. As well as decimal numbers and letters, t 
Standard also gives single-column punching codes fo 
pence, for shillings, for months, and for days within thi 
month, etc. 

In the table, the card rows are identified by 12, 11, J 
1, ••• , 9, from top to bottom; it should be noted, how) 
ever, that modern practice in Britain prefers the top ro 
to be numbered 10. Where nothing is punched in a zone 6: 
position the symbol b is used. The abbreviation Sp is usee 
for the space obtained in a printer from an entirely Wan! 
column. Letter 0 is shown with a dot in it, to distinguish 
it from zero. 'M 


The table gives examples of the treatment of days 
months, pence, etc., particularly when a single column 
used, often with two holes in it. However, even within on< 
type of code, 10 and 11 may be reversed. 

The “old Hollerith” 4-zone code, which is widely used 
is designed to “cycle” within the zones; this brings thj 
letters into alphabetical order. j 

The asterisks refer to the Bull codes in which rows 7, 8 
and 9 are used to punch the zones, in place of 12, 11, an 
0 respectively. Another special optional feature is thi 
“mechanical zero” punched in row 12; this is for left 
hand zeros, and is treated as zero by the accountin; 
machine but does not print. ' 


I 


* Work supported by the U.S. Atomic Energy Commission. 
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Survey of Punched Card Codes — European Equipment 
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Bull* 300 Series Sp 
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Subcommittee X3-2 Character Sets and Data Formats of ASA X3 Sectional 
Committee on Data Processing Standards 


MINUTES 

Sixth Meeting, April 12th and 13th, 1961 
ASA Headquarters, New York, New York 


SUMMARY 

The relationship of the six bit (64 character) coded set to larger 
(7 and 8 bit) and smaller (4 and 5 bit) sets must be determined. This re- 
lationship must be delineated and Included in the recommended standard. 
Accordingly the majority of the meeting was devoted to exploring the content 
of the various sets as a function of set size and determining the conver- 
sional modes between these sets. An interesting structure for a family of 
coded character sets spanning the range from a 4 bit set through an 8 bit 
set was identified. These considerations will be pursued at the next meet- 
ing scheduled for April 26th and 27th in New York. 
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X3- 2 Sub-Committee 
on 

tacter Sots U Data Formats 
' of the 
lean Standards Association 
X3 Sectional Committee 
Sponsored by ' 

O.E.M.I. 


April 19, 1961 


MEETING OF APRIL 12th AND 13th 


MEMBERSHIP £• ATTENDANCE 

Members Prosont * 

John Auwaortor * Teletype Corp. 
Leon Bloom * NCR 

Theodore H. Bonn - Remington Rand 
L. L, Griffin - Navy Department 
I. C. Liggett - IBM 1 

C. E. Macon - Burroughs Corp. 


Members Absent 

H. Kleinberg - RCA 

Lawrence J. Schoenberg - Monroe 
Calculating Machine 


Observers Present 
Robert Gryb - AT&T Company 


toy Reach - Minneapolis Honeywell Capt. W. Luehbert - West Point 

> H. J. Smith, Jr. - IBM 
H. Tholatrup - Friden Inc. 

Allen L. Whitman - Bell Telephone Lab. 

Changes in Membership 

Mr. Robert Gryb was elected to membership by unanimous vote. 

Capt. W. J. Luehbert was elected to membership by unanimous vote. 

Mr. Schoenberg was contacted^by^phon^ln^ddition^o the^^ cUri£y - , 

ing V his 8 in tent C with° respect to the Cosaaittee activities. 

REVIEW of minute distribution 

The subject of Minute rece^e'minutes. 



MEETING 

3. ■ 

3.1 

3.1.1 

3.1.2 

3.2 
3.2.1 


minutes 

OF APRIL 12th 6 13th April 19, 1961 


CORRESPONDENCE 

k 

Sent 

Letter to Mr. Schoenberg requesting that he clarify his status 
with respect to the Committee. 

Letter to Stanford University requesting 20 copies of 
W. J. Luebbert's Technical Report entitled "Information Hand- 
ling & Processing in Large Communication Systems". 

r 

Received 

Letter - Bright to Liggett and members: attached "Further 
Survey of Punched Card Codes" H. McG. Ross, Ferranti Ltd., 
London. 


3.2.2 Letter - Bright - Format for Cover Sheets of Minutes of Meetings. 

3 . 2.3 From Bright - Revision of Fact Sheet on the Data Processing Group. 

3 . 2.4 In reply to an earlier inquiry, Mr. Ross of Ferranti sent a copy 
of his report. "Considerations in Choosing a Character Code for 
Computers and Punched Tapes". 


PING 
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A.F.I.P.S. 

The Chairman will continue hia investigations in this area. 

£5,3 Communications 

Mr. Auwaerter submitted a list of four of the principle user groups 
in the communication area. (See Attachment A). 

Mr. Howard reported briefly on his visit to the* recent Air Transport 
Association meeting. Their work at this point is not directly 
related to the present considerations of X3-2. 

X3 MEETING 

The Chairman gave a brief report on the recent X3 Sectional Committee 
meeting. A copy of the Status Report which he presented at this meeting 
is attached. (See Attachment B), 


MINUTES 

The Minutes of the X3-2 meeting of March 8 th and 9th, 1961 were approved 


4. DOCUMENTS DISTRIBUTED 

4 .! ■ Draft report by Smith on the philosphy of structuring a family of 

coded character seta. 


8, DISCUSSION OF CODED CHARACTER SET FAMILIES 

i- • 

8.1 Mr. Smith presented a discussion of the draft report on this subject 

(reference 4.1 above) 


Draft of the "Report on EIA Survey of Information Processing". 
"An 8 Bit Code Set Structure" by J. F. Auwaerter. 


5. USER GROUPS 

5.1 JUG 

The Chairman reported that he had contacted Mr, Harry Cantrell, 
President of J.U.G. Mr. Cantrell indicated that J.U.G. would like 
to cooperate and would be willing to. 

a) Supply a representative to X3-2 

b) Sot up separate task groups to work in certain areas and/or 

c) Supply information on request from X3-2 

X3-2 will consider which of these courses will be of the most value 
to both groups. ’ 



In the open discussion which followed tho considerations which must 
be included in determining the structure, content and interrelation- 
ships involved in such a family of sots wore examined in some detail 
It was determined that tho ochemo presented in tho draft report was 
very sound from sevoral points of viow. Somo of tho points brought 
out in the discussion weru: 

Collapsing a Six Bit Code into a Five Bit Code 

Some light Investigations which hnvo been made on this subject 
indicate that thoro is littlo intoroot in this feature unless, 
of course, tho collapse is to tho 5 Bit Bandot Code (which is 
highly unlikely if a elmplo schomo is invoked). 

Double Purpose Keys 

With regard to the code difference between characters carried 
on "double purpose keys" it was noted that a 2 bit (as opposed 





Location of alphabet 

Locating tho alphabot in oitlior tho first two or last 
two quadrants of tho 6 bit aot has the following 
advantages: 

a) Tho code ia easier to handle in "primitive" 

(i.e. simple) equipment, (o.g. key not equipment, 
paper tape readers and punches, small printers and 
the like). 

b) The code ia more easily expanded to larger seta or 
contracted to smaller seta. 

The disadvantage is the effect noted in 8.2. 3. 2, above. 

7 Bit Set 

The point was brought forward that it might be desirable to 
standardize on a seven bit set instead of a six bit set as the 
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Standard for in*o»»£o« ^7^ 
data processing. This sc b0 given some study Dy 

fhfmUlrf - wiU Considered at the next meeting. 


mm CRITERIA ^ 6A_CHARACTERjET 


will be discussed at the nex 


meeting 


~ rr.r, 


Rpg ular Mee tij 


r T i me: 9:30 A.M. 4/26/61 
Engineers Club (3rd Floor) 
32 West 40th Street 
New York, N. 


May 8th (Hon.) * U* 


informa l Meet ini 


^r !; 8 at tending” the TstTn Joint^puter 

Conference. 


a lace will be announced at the April 26th & 27th 
Exact time and place will 
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CTATI1S REPORT 

ASA X3^2 SUBCOgllTTEE 


t ly act ive on the X3-2 Subcommittee 

following people are pres 


I. C. Liggett, Chairman 

C. E. Macon, Secreta y 

John Auwacrter 

Loon Bloom 

T. 11. Bonn 

L. L. Griffin 

H. Kleinberg 

R. W. Roach 

L. J. Schoonborg 

H, J. Smith, Jt. 

H, Tholstrup 

A. L. Whitman 


IBM 

Burroughs 

Teletype 

NCR . 

Remington Rand 

UOD (Navy Dept.) 

RCA 

M-H 

Monroe 

IBM 

Frldon 
Boll Labs. 


A. L. Whitman 8 

- — — - — —“tr 


J. H. Craig 
E. A. Emerson 
r. W. Green 
W. S. Nlviaon 
E. C. Olofson 
B. B. Weber 


AT&T 

Standard Register 
Re cord ak 

Royal McBee 
A-M 


Our original objectives were representation'an^then iterate^over^theae 

’ design S obiectlves: ' 



>■ i h “ 0 U Q W b0 8lmilar to existing codas and/or a union of existing 


i/ Contain same number of bits per charactor. 

3) s u ^bo u i s c :a e ;o 8 f "bie" habec ‘ numbera ‘ contro1 - and as 

4) Logical structure for expansion and contraction. 

Provide simple means of categorizing groups of characters. 

6) Numbers should bo BCD (Binary Coded Decimal). 

7) Maintain historic collating sequence. 

Simple sorting procedure to maintain collating sequence. 

Contiguous alphabet and contiguous numbers. 

10) Expansion for foreign alphabet, and allow for 10 • 11 for Britain. 

11) Suitable for data transmission. 

12) Must contain data format control and equipment control characters. 

U) basis 3 (ESCAPE) 3 meana ° £ large alternate 8eCs on an exception 


pr0posal haa about an 807. chance of gaining acceptance by 
M y 1961 within our committee. We believe it represents the best solution 
to the criteria mentioned with the least compromise. 

This character set is thought of as a data processing-communication code, 
we believe it is organizationally sound, usable in 4, 5, 6, 7, and 8 bit 
versions and has a probable life of 15 years. Our immediate schedule 
contains the following: 

1) . Reach agreement on a final proposal. 

2) Work with the Fieldata people and determine their reaction to this 
new code. 


1. C. Liggett 


3/27/61 


April 19, 1961 


ATTACHMENT C 

BASIC CRITERIA FOR 64 CHARACTER SET 


* Cari wcr ® ^Pi-led by an informal task group working the evening 
Ws i 1 Refarenco is made to Fieldata, Section 3.6 Criteria for Alpha- 
ric Code Selection and the criteria set forth in the EIA Tentative Standards 
posal for Basic Character Sot Coda of May 25, 1960. I The 10 points set forth in 
eldata are included and all but 2 of the points of the EIA proposal'. The EIA 
infs not included are: .. 

H) The numerals should be contiguous with the alphabet. 

J) The numerics should follow the alphabet. 


The Set is to facilitate the Interchange of information between 
Systems and between Systems and Associated Equipment. 

The set is to include: . 

a) The alphabet (Upper Case) 

b) The numerals 0-9 

c) Certain special characters 

d) The minimum control codes required to 
Implement (1) above 


ELA-E-C 3) 
FD-2 


The codes for all symbols of the set are to include the same 
number of bits (6).' 


EIA-E 4) 


EIA-D 5) 
FD-3 


The code set is to be structured such that expansion to larger 
sets or contraction to smaller sets can be achieved in a 
simple and logical manner. 


The basic set and code shall not contain error control. 


EIA-B&A 6) The alphabetic sub set shall be contiguous. 



April 19, 1961 


The numeric sub see shall bo contiguous 


The cet shall be so organized and coded that the 
possible test shall be adequate to distinguish and identify 
the basic alphabetic, numeric and symbolic sub sets. 


The numeric sub set shall be so coded that the least four 
significant bit shall be the binary coded decimal form of 
the decimal digit which the code represents. 


The alphabetic sub set shall be so coded that the code for A 
will be the smallest binary number and the code for Z the 
lamest binary number within the alphabetic code area. (Of 
course the cSe and alphabet are monotonically increasing 
from A — ► Z) • 


EIA-I ID The code set shall contain an "Escape" character and code 


12) . If possible, the set will include tho COBOL character set 


The code 000000 will be reserved for a space or blank 
function. 


14) The word separator must be '.'low 1 


a string for interpretation. 


The "binary sequence and the collating sequence of the character 
set bo identical. 


The all ones code will bo reserved for a delete function, 


•. 17) 


0 0 0 0 


AU 8 KTT COPE SET S TRUCTURE 

10 11 

■ 00 01°10 - U oo-oTTo-n oo oi iu IT oo oi io u 

MS 

B S D A Z 7 7 C C 7 7 a Z 7 7 C C 


B7 & Bg 
B5 & B4 


Notation: -MS is communications BLANK 

Fieldata Master Space - all 0 s 

6 is data BLANK, communications SPACE 

S is Symbols 
D is Digits 

A - Z are upper case Alphas 
C is Control 

a - z are lower case Alphas 

Characteristics: 

!. Provides a 4 bit subset in column B 7 B 6 B 5 B 4 — 0001 

2 . strips to a 5 bit subset in columns B 7 B 6 B 5 B 4 — 0000 and 0001 

Master Space and Blank (Space) 
collate low, as special graphics 

3. Strips to a 6 bit subset in columns B 7 B 6 — 00 

4. The 4, 5 and 6 bit subsets have no control characters 

5. Provides a 7 bit set in B 7 B 3 — 00 and 01 

6 . Eight bit set provides lower case alphas and more controls 

7. Controls all have B5B5 •-11 

8. Alphas all have BgB5 ^”01 


.... ..,1... “* ■ ii “ “ >u *’ 

,0. for •"* ■“ “ 


J. F. AUWAERTER 
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M E M 0 R A N D U M REPO R T 

SIGRA/SL--NPE 14 A P ril 1961 

MEMORANDUM FOR: Director, Data Processing Facilities Division 

SUBJECT: Some Possible Future Changes That T $ill Not Affect Present 

Systems and Equipments in the FIELBATA Keyboard and Code, 
and the Effects of These Changes 


PURPOSE: The purpose of this memorandum is to state some changes 

that may be desirable in the FIELDATA Keyboard and Code. The desirability 
of these changes will be discussed along with the estimated effects on the 
FIELDATA system. It is hoped that the suggestions and comments received 
from persons involved in the FIELDATA program will give rise to a final 
report of recommended changes. 


BACKGROUND: The Minutes of the Sixteenth Meeting of the Third Review 

of the Committee on Military Communication System Technical Standards, dated 
30 January 1961, contained a proposed modification to the FIELDATA code by 
the Teletype Corporation (Bell System) which they propose using for a new 
teletypewriter service under consideration by the Bell System. 


Since the ADPS Keyboard has certain inconsistencies with com- 
munications oriented devices it would be desirable to modify ^ 
code and Keyboard if such changes were tolerable. It should be emphasized 
that these changes do not affect present equipments, systems and items 
under development. 


DISCUSSION OF CHANGES: The following will sot be changed: 


Master Space G 
Upper Case R 
Lower Case I 
Line Feed J 
Carriage Return K 


Space 

A 

B 

C 

D 

E 

F 



M 

N 

0 

P 

Q 

R 




0 

1 

2 

3 

4 

5 

6 

7 

8 
9 


i 


/ 


Special 



The following are possible changes: 
- changed to I 


St> 

1! II 

& 

1 

11 t! 

, 


11 1! 

— 

delete 

1) 11 

idle 

idle 

It t! 

delete 

> 

II 1! 

# 


ri n 

# 

< 

1! » 

% 

Blank/idle 

If it 

blank 


It 11 

bell 

. * 

II It 

* 

t£) 




'•—I' — 

The above changes were motivated by: 

(a) In attempt to arrange the keys so that they would be more 
in linb with commercial keyboards. 

(b) An attempt to make FIELD AT A alphanumeric compatible with 
a 6 level commercial code ( ATT ) . 

(c) An attempt to replace symbols which would be rarely used 
by possible more useful symbols. 

DISCUSSION OF CHANGES : 

BLANK /IDLE to BLANK would only involve a change in name. This change 
would avoid ambiguity (since there is an idle code), and would better 
describe the code function. 

The changes would enable placingj'underline" and - ^ash»on the same 
thev now differ only in one bit with respect to each other. 
Also* since "dash" is used for "minus'* it would now be in the same case 
reference to the numerical codes. 

The exclamation mark (l) and the Bell are nov able to be placed over 
the 1 and 6. 

..Greater than- ,> , "equal to" = and ness than" <s « IM 

and replaced by #, f, and % respectively. These substitutions are 

possibly more useful. 

The © symbol was deleted and the code assigned to ,! Period". Th± s 
change enables both "comma" and "period" to each have a separate key 

( p 

V Ci <» cS? ■<£»*■ 


E 


both upper and lower cases. This is identical with commercial practice. 

The idle code combination would now represent delete, and the delete 
code combination would now represent idle* Idle would therefore now pe a 
control character, and delete would now be an alphanumeric character." The 
major benefit of this change is that the FIELDATA paper tape code can now 
be slightly modified to enable the conversion process of Standard Form to 
Paper tape Form (and vice versa) to be realized by a simple inversion of 
the parity bit. Another advantage is that the FIELDATA n delete ,! is now a 
true 6 bit code delete. 

By depressing the control code key the keyboard can be used to 
generate all 64 control characters. 

Note : Although there is no backspace key on the proposed keyboard, the 
keyboard will be capable of generating and functionally responding to the 
control code for backspace. 


Prepared by 


Division Keyboard Committee 
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MS 
UC 
LG 
LF 
GR 
■ SP 
• A 
B 
C 
D 
E 
F 
G 
R 
I 
J 
K 
L 
M 
N 
0 
P 
Q 
R 
S 
T 
1) 

V 
W 
X 

Y 
Z 



Blank 

UC 

LC 

LF 

CR 

SP 

A 

B 

C 

D 

E 

F 

G 

H 

I 

J 

K 

L 

M 





ALPHANUMERIC 



special 

Delete 



I 


• J 



PROPOSED 

CHARACTER 


BLANK 
TST ' 

TCL 

TAB 

CONTROL 

CONTROL 

it 


STANDARD CODE 


PAPER TAPE 

(ODD 

i PARITY) 


(EVEN PARITY) 

P C 

I 1 * T 1 °3 °1 ° 

0 

P C 

h h d 3 3 2 D 1 D 0 

7 6 

5... .4 3 2.1 0 


7 6 

5 i 3 2 1 Cl 

1 0 

0 0 0 0 C 0 

1 

0 0 

0 0 C 0 0 0 


0 0 0 0 
0 0 0 0 
10 0 0 
0 0 0 0 
1 coo 
10 0 0 
0 0 C 0 
0 0 0 0 
10 0 0 
10 0 0 
■0 0 0 0 
10 0 0 
0 0 0 0 
0 0 0 0 
10 0 0 
0 0 0 1 
10 0 1 
10 0 1 
0 0 0 1 
10 0 1 
0 0 0 1 
0 0 0 1 
10 0 1 
10 0 1 
0 0 0 1 
0 0 0 1 
10 0 1 
0 0 0 1 
10 0 1 
10 0 1 
0 0|0 1 


1 0 
1 1 


0 0 
0 1 


1 0 
1 1 
0 0 
0 1 
1 0 
1 1 


10 0 0 
10 0 0 
0 0 0 0 
10 0 0 

0 0 0 c 
0 0 0 0 
10 0 0 
10 0 0 
0 0 0 0 
0 0 0 0 

1 C C 0 
0 0 0 0 
10 0 0 
10 0 0 
0 0 0 0 
10 0-1 
0 0 0 1 
0 0 0 1 
10 0 1 
0 0 0 1 
1 0 0 Li 
10 0 1 
coo 1 
ooo 1 
10 0 1 
10 0 1 
ooc 1 
10 0 1 
0 0 0 1 
0 0 0 1 
IOC 1 


0 0 
0 0 
0 0 


1 0 

1 0 


0 0 
0 1 


0 0 
0 0 


0 1 

0 1 


0 0 
0 1 


0 0 
0 1 


BLANK/IDLE 

TST 

TCL 

TAB 

Control CR 
Control SPA 
" A 
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CHARACTER 

STAN 
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7 6 

IDARD CODE 
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D 
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!R 

n 

Ip 

5- 

TAPE 

Parity) 

X 1 d 3 

D 2 

2 

D 1 
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Dial 6 
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0 
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Dial 0 

Dial 1 
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1 

0 

0 

0 
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1 
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1 

0 
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0 
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0 

1 

0 

0 

1 

1 

1 

0 

0 

1 

0 

0 

1 

1 

1 

" 7 

Dial 8 

1 

0 

1 

0 

1 

0 

0 

0 

0 

0 

1 

0 

1 

0 

0 

0 

" 8 

Dial 9 

0 

0 

1 

0 

1 

0 

0 

1 

1 

0 

1 

0 

1 

0 

0 

1 

" 9 

SOC 

0 

0 

1 

0 

1 

0 

1 

0 

1 

0 

1 

0 

1 

0 

1 

0 

SOC 

SOB 

1 

0 

1 

0 

1 

0 

1 

1 

0 

0 

1 

0 

1 

0 

1 

1 

SOB 

SOD 
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c 
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0 
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420 LEXINGTON AVENUE 


NEW YORK 17, NEW YORK 


MURRAY HILL 3-6433 


DATA PROCESSING GROUP 

OFFICE EQUIPMENT MANUFACTURERS INSTITUTE 

JOHN H. HOWARD CIRECTCR 

HIRDBRT D. SRIOHT ENGINEERING DIRECTOR 


Subcommittee X3-2 on Character Sets and Data Formats of 
ASAX3 Sectional Committee on Data Processing Standards 


l^Seventjy Meeting , April 26th and 27th, 1961 
Engineers Club, New York , New York 



SUMMARY 

A report on a meeting between some of the members of X3-2 and the 
Committee on Military Communications Systems Technical Standards 
was given and discussed. The "Basic Criteria" of character sets 
were discussed and a revised set (attached) was prepared for 
committee consideration. Several methods of expanding and con- 
tracting members of a family of sets to obtain other members of the 
family were presented and examined in detail, A new approach and 
organization was presented in outline form for consideration by the 
members of the committee. 


I, C. Liggett, Chairman 




C, E. Macon, Secretary 




X. C. Liggett - IBM 

C. E. Macon - Burroughs Corporation 

Roy W. Reach - Minneapolis Honeywell 

H. J. Smith, Jr. - IBM 

H. Tholstrup - Friden Inc. 

Allen L. Whitman Bell Telephone Lab. 

John Booth (Robert Gryb'a alternate)- 
AT&T Company 

Members Absent 

Leon Bloom - NCR 

H. Klelnberg - RCA 

Capt. W.' J. Leubbert - West Point 

Lawrence J. Schoenberg - Monroe Calculating Machine 
Observers Present 

Willis DeHart - Defense Communications Agency 
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May 4, 


MINUTES 

MEETING OF APRIL 26th AND 27th 


2. MINUTES 

, -The minutes of the April 12 and 13th meeting were distributed, read 

and approved. 

3 . INFORMAL MEETING 

The informal meeting to be held concurrently with the WJCC was briefly 
discussed and agreed upon. The meetings will be held from 9 A.M. - 
5 P.M. Monday, May 8 th and Thursday, May 11th and possibly the morning 
of Wednesday, May 10th. The place of the meetings will be the Mayan 
Hotel, 3049 West 8 th Street, Los Angeles, California. 

4. MEETING REPORT 

On 4-25-61, Messrs. Auwaerter, Griffin, Liggett and Whitman met with 
the Committee on Military Communication Systems Technical Standards. 
The purpose of this meeting was to informally introduce the Committee 
to the activities of X3-2 and discuss mutual problems. The result of 
this discussion for X3-2 was a better understanding of the Military 
problem. The main point, as pertains to X3-2, is the following: 

The military, for lack of an industry standard and lack of 
definitive action to establish such a standard, developed 
their own standard, Fieldata. At this point in time they 
have a considerable amount of hardware and software under 
development which incorporates this standard. An industry 
standard which is different from the mil standard would have 
to present strong advantages over the mil standard before it 
would gain acceptance by the military. f 


BASIC CRITERIA 

The "Basic Criteria", attachment C of the Minutes for the April 12th 
and 13th meeting, were discussed and expanded. This revised document 
is included as attachment "A" for consideration by the committee at 
the next meeting. 
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Needless to say it was stipulated that the message to be transformed 
must be limited to characters contained in the smaller of the two 
sets. Consideration of characters derived from such devices as mode 
shifts, character pairs and the like was ruled out on the basis that 
the characters and codes would then not meet the criteria set forth 
for the standard. 


Two fundamental schemes were considered, 
below: 


Scheme 1 - Add a Bit 


These are briefly described 


This scheme is discussed in some detail oelov . one ,-tuoy of 
Figure 1 will indicate the possibilities and limitations of 
the scheme. Basically, Be, B^, B 7 & B 3 are successively added 
to progress from a 4 level to an 8 level code. When detailed, 
the set for each level would be fully delineated. 


"Upward" Conversion 


4 level set 


U 4 U 3 U 2 U identify the 16 characters of the 
"numeric" set. This set includes O' - 9 
and some specials. 


5 level set B^ is added, 


5 - 1 (i.e. 1 B^B-^B^) is the 4 level set. 

5*0 (i.e. 0 B 4 B 3 B 2 B 1 ) identifies the 16 
additional characters of the 5 level 
set. 

to 5 level conversion: 

For data expressed in terms of the 4 level 
set, insert a 1 in the B^ position to 
express the same data in the 5 level set. 

to 4 level conversion: 

The data must be expressed using only 
those characters available in the 4 level 
set. If this condition is met then 
"ignor" or strip off the B,. bit (which 
will be 1 in all cases) ana the data will 
be expressed in the 4 level set. 


General Case 


N level set B^ is added 


B N - 0 (i.e. 0 B ] 
set. 

B n « 1 (i.e. 1 B, 


N-l 


N " 1 


.B^) is the N-l level 
. B^) identifies the 


characters available in addition to those 
of the N-l level set. 
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NOTE: Depending on the placement of the N-l 
level sec in the N level set 0 & 1 may 
be inverted. If N-l is identified by 
Bjj ° 0 then the N-l set will be located 
in the lower half of the N set. If N-l 
is identified by B N = 1 then the N-l set 
will be located in the upper half of the 
N level set. Of course this pattern 
would be specified when the organization 
of the family of sets was decided. 


Example 1 


Take the number 1961 in 
the 4 level set 

B3 B'2 B^ 

0 0 1 

0 0 1 

1 1 0 

0 0 1 

in the 5 level set 

B 5 B 4 B 3 B 2 B 1 

1 0 0 0 0 1 

9 0 1 0 0 1 

6 0 0 1 1 0 

1 0 0 0 0 1 


b 4 

1 0 

9 1 

6 0 

1 0 


Note that by eliminating the B 5 digit the 5 level code 
becomes the 4 level code. Also by simply placing an 
"O" in the Be position the 4 level coded representation 
becomes the 0 level coded representation. 


Example 2 


Consider the identifier AX 56 
in 6 level code this, is 


A 

X 

5 

6 
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Now if we strip off the 
B & position we have 

Code Character 

B 5 B 4 B 3 B 2 B, which is 

0 0 0 0 0 

10 111 l 

10 10 1 \ 

1 0 1 1 0 6 


The problems here, of course, is that the identifier 
contained characters which were available in the 6 level 
code but were not available in the 5 level code. 


(t 


This scheme has the advantage that it is easy to understand In 
terms of hardware, the transform from * larger set to a smalle 
set can be accomplished by simply ignoring the most significant 
bit position (or positions). The transform f: r ° m * (0 or 1) 

a larger set can be accomplished by inserting a fixed bit (0 or 1) 
in the most significant bit position (or positions). 

The disadvantages would be: 

a \ It would be impossible to maintain the "all zero s 

3) character thru the transform (e.g. 000000 >1000000) 

and, at the same time preserve the "all ones" character. 

b) The basic collating sequence could not be maintained 
between sets. 

c) It would not be possible to simply group like subsets. 



Scheme 2 - Explosion 


The scheme would, by means of a simple logical transform, maintain 
Se relationship of basic types of characters. For example, the 
specials would be low, the numerics next, the alphabet next and 
fhe controls high. The content of each of these sections (numbe 
of 6 characters!) ^would vary as the level of the code. The scheme 
is delineated in more detail in Smith s report. 
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MINUTES 

MEETING OF APRIL 26th AND 27th 


A IQfil ' 


The scheme overcomes Che disadvantages of Scheme 1. The trans- 
form from a larger to a smaller set could be accomplished by 
simply ignoring bits as before. However the transform from a 
smaller set to a larger set would depend on the content of one 
(or more) of the lower order bit positions. 

It should be noted that the expansion transform is inherently 
simple, both logically and in terms of hardware. Complexity 
is added, however, if control is included in the 6 bit (and 
smaller) sets. 




Communications 

In examining the communications problem, it appears that an 8 bit 
character might be reasonable as a standard. One of the bits 
would be a parity bit. The structure would call for 64 characters 
which could be printed and 64 controls and special characters (the 
specials in this area would not be printed by standard terminal 
equipment). The printable characters could be identified by exami- 
ning the highest order (7th) information bit. This is essentially 
the Fieldata structure. 

An approach 

In further study of the problem the following scheme was developed. 
The study is not complete and a conclusion was not reached. Dia- 
grammatically it is 


DP Standard 


DP 6 1 Com 
standard 


No controls contains 
alpha, numeric & special 

Controls & certain specials 
added 

^ 8 bit 

Exploded set 

D.P. standard 


(The organization of the eight bit set has not been established. 
Several possibilities exist all of which would not complicate 
the "explosion" transform). 
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BASIC CRITERIA FOR A STANDARD FAMILY 
OF CODED CHARACTER SETS 


“ Introduction 

In considering the problem of a standard which will ' facilitate the interchange 
of information between systems and between systems and associated equipment, 
it is implicit that such interchange must take place on a wide variety of 
devices which may well inherently work on basic codes of differing levels. 

The "standard 11 must be a family of coded character sets in which one member 
set exists for each level (i.e. 4,5, 6,7, & 8 bit code levels). The inter- 
relationship between these family members must be studied and defined so that 
the scheme of communication between devices of different levels is delineated 
and can be accomplished in the most reasonable manner. 

It is understood that certain members of this family will be identified as the 
preferred set for information interchange and communication. Such established 
preference will lead to the evolution of equipment and procedures which will 
further facilitate the interchange and communication of information. 

Reference is made to Fieldata, Section 3.6. Criteria for Alpha-numeric Code 
Selection and the criteria set forth in the EIA Tentative Standards Proposal 
for Basic Character Set Code of May 25, 1960. The 10 points set forth in 
Fieldata are included and all but 2 of the points of the EIA proposal. The EIA - 
points not included are: 

H) The numerals should be contiguous with the alphabet. 

J) The numerics should follow the alphabet. 


Specific Criteria 

1) The family of sets are to facilitate the interchange of information 
between Systems and between Systems and Associated Equipment. 

2) The codes for all symbols included in a set at a given level are to 
. include the same number of bits. 

3) Each coded set is to be structured such that expansion to a larger set 
or contraction to a smaller set can be achieved in a simple and logical 


0 



Basic Criteria-Std. Family 
Coded Char. Sets 



4) Within each standard set each character and corresponding code will stand 
by itself and not depend on- surrounding characters in the string for in- 
terpretation. That code (or codes) identified as a mode shift or escape 
function will meet this condition and will be considered to initiate de- 
parture from the standard set. 


5) The 6, 7 & 8 level sets (i.e., 64, 128 & 256 character sets) are to 
include: 


The alphabet 

The numerals, 0-9 

Certain special symbols 

5.1) A useful 4 bit (16 character) numeric subset is to be defined. 

5.2) The minimum control codes required to implement criteria #1 above are to 
be included. 

5.3) For those levels that include an alphabetic subset, allowance is to be 
made for this subset to contain more than 26 letters. In the larger 
sets, allowance is to be made for the subset to include up to 32 
characters. This provision will allow for incorporation of foreign 
alphabets. 

5.4) In the larger sets, the numeric subset shall be so structured that the 
binary coded decimal codes for 10 & 11 (i.e., the least four significant 
bits) can be assigned symbols and included with the numerals. This 
allowance is to provide for adaptation of the code to foreign problems. 

6) The numerals shall be so coded that the least four significant bits shall 
be the binary coded decimal form of the decimal digit which the code 
represents. 

7) The English alphabet shall be contiguous. A— >2, with the smallest binary 
number of this code area representing the letter "A". 

8) The code consisting of all zeros as transmitted will be reserved for a 
space or blank function. 

9) The all ones code will signify a delete function in paper tape. 

10) Each set will contain an "Escape" character and code. 

11) The simplest and most logical relationship possible shall exist between 
the binary sequence and the collating sequence and, wherever possible 
the two shall be identical. 
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word separator must collate low. 

^Special symbols commonly used in sort keys and certain punctuation symbols 
* must collate low with respect to the alphabet and numerals. 

The basic collating sequence (i.e., the relationship between the numeric, 
alphabetic and special symbol subsets) will be established giving due 
consideration to those sequences used in existing files and will be 
structured such that the existing file sequences can be dealt with. 

The special symbols will be so placed in the set as to simplify their code 
generation by typewriters and similar keyboard devices. 

The set shall be so organized that the simplest possible tests shall be 
adequate to distinguish and identify the basic alphabetic, numeric and 
special symbol subsets. 

If possible, the set will include the COBOL character set. 

A standard character set will be identified for general information inter- 
change and transmission. The remaining members of the standard family 
are available for use in those situations in which the interchange link 
is known and codes of levels differing from the general information inter- 
change and transmission standard can be advantageously employed. 

Consideration will be given to inclusion of suitable means for data 
delineating and format identification in detailing the standard sets. 
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PUNCHED TAPE CODES 


1 . INTRODUCTION 

During the last two years users and manufacturers of computers, 
especially in Britain but also in U.3.A. , have been working together to 
establish codes for punched paper tapes. As an outcome, some standard codes 
have been proposed for 4, 5? 6, 7 8-track tapes. 

The primary aim of such work is to facilitate the interchange of 
tapes between users of different computers; probably this will be of most 
value for tapes conveying data or intermediate results, but it will also be 
relevant if common programming languages come into use. Secondly, this 
work should unify and simplify the development of associated equipment, etc. 

In preparing any standard code, it is necessary to seek a proper 
balance between con^atibility between users, and flexibility to accomodate 
differing requirements. Accordingly, only those characters and functions 
considered most important are defined, and blank places are left in each 
code to be used as required. 

Attention has been concentrated on choosing symbols for data, rather 
than for programs; and it has been considered important to provide 10 and 
•'ll as single characters for pence. 

The proposed standard codes are shown in the attached tables, and 
are followed by certain other important tape codes. 

In the tables, the tapes are viewed from the top, and are shown moving 
towards the user. This is the way the more important items of tape editing 
equipment are arranged, and it corresponds to movement from right to left 
in an ordinary computer input reader. The codes are arranged so that tapes 
may be placed in the computer input reader with the 3-track side (with 
respect to the sprocket holes) nearest the operator, and correspondingly 
in other machines. 

Against each character is shown its value, obtained by regarding 
each combination of holes and no-holes as a binary number; in general, a 
hole is treated as a binary 1 , and no-hole is a binary 0. 

The tables are arranged in zones, each with i6 positions; the 4 
less-significant bits of each character determine its position within the 
zone, and the remaining more-significant bits determine its zone. 

There follow some comments on the various codes, drawing attention 
only to those points which it is thought may most interest users of Ferranti 
computers. 


2. PROPOSE D STANDARD 4-TRACK TAPE CODE 

This is related to the proposed standard 3 -track computer tape code. 
The tape has the same physical dimensions and layout as 5-track tape, with 
one track not used. It is primarily intended for lowest— cost equipment. 
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1 . INTRODUCTION 

During the last two years users and manufacturers of computers, 
especially in Britain but also in U.S.A. , have been working together to 
establish codes for punched paper tapes* As an outcome, some standard codes 
have been proposed for 4, 5» 6, 7 and 8-track tapes. 

The primary aim of such work is to facilitate the interchange of 
tapes between users of different computers; probably this will be of most 
value for tapes conveying data or intermediate results, but it will also be 
relevant if common programming languages come into use. Secondly, this 
work should unify and simplify the development of associated equipment, etc. 

In preparing any standard code, it is necessary to seek a proper 
balance between compatibility between users, and flexibility to accomodate 
differing requirements. Accordingly, only those characters and functions 
considered most important are defined, and blank places are left in each 
code to be used as required. 

Attention has been concentrated on choosing symbols for data, rather 
than for programs; and it has been considered important to provide 10 and 
il as single characters for pence. 

The proposed standard codes are shown in the attached tables, and 
are followed by certain other important tape codes. 

In the tables, the tapes are viewed from the top, and are shown moving 
towards the user. This is the way the more important items of tape editing 
equipment are arranged, and it corresponds to movement from right to left 
in an ordinary computer input reader. The codes are arranged so that tapes 
may be placed in the computer input reader with the 3-track side (with 
respect to the sprocket holes) nearest the operator, and correspondingly 
in other machines. 

Against each character is shown its value, obtained by regarding 
each combination of holes and no-holes as a binary number; in general, a 
hole is treated as a binary 1 , and no-hole is a binary 0. 

The tables are arranged in zones, each with -|6 positions; the 4 
less-significant bits of each character determine its position within the 
zone, and the remaining more-significant bits determine its zone. 

There follow some comments on the various codes, drawing attention 
only to those points which it is thought may most interest users of Ferranti 
computers. 


2 . PROPOSE D STANDARD 4- TRACK TAPE CODE 

This is related to the proposed standard 5-track computer tape code. 
The tape has the same physical dimensions and layout as 5-track tape, with 
one track not used. It is primarily intended for lowest-cost equipment, 
the tape perhaps being punched as a by-product of working an office machine. 
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3. PROPOSED STANDARD 5 -TRACK COMPUTER TAPE CODE 

This follows the Elliott (405) tape code, when in its form with 10 and 
11. While being quite good for computer use, it has the same Figure Shift, 
Letter Shift, and Who Are You as the Telex code; these may be inportant for 
data transmission over normal telegraph circuits. 

Numbers and some important symbols and functions have an odd number 
for holes for checking; then the 5th bit is removed leaving numbers in proper 
sequence (as in Pegasus/Mercury Code). The code has not, however, the 
optimum checking characteristics; in particular, five of the numbers can 
degenerate into No Action by the loss of a single hole. 

Making Letter Shift and Erase the same is contrary to Ferranti practice 
but it is normal for Telex and for Elliott working, and Elliotts have pro- 
gramming techniques and conventions to deal with it. 

5-track tape is ll/l6ths inch wide; somtimes the 5 tracks are punched 
in 7/8ths inch wide tape, the extra space (on the 2-track side) being used 
for printing. 

It seems unlikely that Ferranti systems will use this tape code much, 
although it can be accepted on all Ferranti computers (except Sirius), using 
suitable input/output routines. 


4. PROPOSED STANDARD 6 -TRACK TAPE CODE 


The 6 tracks are punched in tape of the same physical dimensions 
(7/8ths inch wide) as 7- track tape, one track not being used. The code 
has no inherent checking properties. 

Because of these points, it is difficult to visualise this 6 -track 
punched tape being much used in practice. However , the code itself is 
designed to b e particularly suitable for use within computing systems, 
especially for commercial work. The theory lying behind it is described 
in "Considerations for Choosing a Character Code for Computers and Punched 
Tapes", Ross, The Computer Journal, Vol.3, No.1, January i960, p.202 
(reprints available). 

This basic 6-bit data code is the basis of the proposed standard 7 
and 8-track tape codes, and is used inside Orion and Atlas systems. It has 
proved suitable for a wide variety of applications, is applicable to many 
peripheral and associated machines, and is capable of development in many 
ways including data transmission.. 

The code has the more important symbols and functions in a single 
shift. In addition, the Shift Out character permits a shift out of this 
set of symbols into any other, and Shift In permits a return into this set 
of symbols. These are locking shifts. 

The Escape character, which is of great importance, means that the 
next single character has a special meaning, i.e. "escaped" out of the 
standard code. It is non-locking shift character. One use is that the 
next character should be treated as a special function, or as a warning 
character. It may be used for ignoring a block of faulty data, or in 
connection with data- verification. Another application is that the next 
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character means "shift into another case or code and stay in it", i.e. a lock- 
ins-shift. Thus it may he used to obtain compatibility between many existing 
and future codes, and it permits virtually unlimited extension of a standard 

code. 

ARQ means request for re-transmission, used in certain systems of data 
transmission, in the event of an error. 


5 . PROPOSED STANDARD 7-TRACK TAPE CODE 

This is the basic 6-bit code laid out on 7-track tape with the addition 
of an odd-parity bit in track 5* 


7-track tape is 7/8ths inch wide. 

Probably the dominant justification for using 7 - track tape at present 
is that some Creed tape editing equipment can be obtained, representing very 
R ood value for money. Such equipment - keyboard punches, verifiers, and 
tape copiers - would be suitable for routine punching of data m quantity; 
but such tape editing equipment, for use with Ferranti computers, does not at 
present incorporate a teleprinter (c.f. the Orion/Atlas Flexownter 7-track 
tape code, below). 




6. PROPOSED STANDARD 8-TRACK TAPE CODE 

This is the basic 6-bit code laid out on 8- track tape (-1 inch wide) 
with the addition of an even-parity bit in track 5 and a hole in track 8 
to mean lower case. 


In a computer with 6-bit characters, the upper and lower cases would 
probably be dealt with by introducing case-shift characters, the necessary 
conversion from the 8th track being done by input/output routines. However, 
in a computer with 8-bit characters, the characters from tape can be accepted 
directly; the purpose of making the lower-case hole contribute value 2 in- 
side the computer is to interleave upper and lower case letters properly 
for sorting, and to permit lower case to degenerate into upper case, giving 
the basic 6-bit code, if the least-significant bit is ignored. 


It is probable that many users will wish to replace the . suffix 
numbers in zone J \ , lower case, by other symbols, e.g. programming or 
commercial signs; these would be too varied to warrant standardisation. 

A tape editing set (BIMA) based on an IBM typewriter, and a keyboard 
perforator and verifier (Creed), using this 8-track tape code, are like y to 
become available during l96l. There is little doubt that this code will b 
widely exploited in the next few years. 


INTERNATIONAL TELEGRAPH ALPHABET NO. 2; TELEX 5-TRA.CK TAPE CODE 


This has been arranged in the table in two forms; first, arranged with 
the characters on tape in the same sequence as the other codes; secondly, 
arranged in the standard CCITT sequence. 


This code does not use the concept of a value being associated with 
each track and, therefore, no value can be given to each character (considered 
as a binary number). 
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However, each combination of holes in tape is given an identification 
number, as shown in the second form of the table. 

In telegraphy, each character is serialised, track 1 being transmitted 
first and track 5 last. 


8. EEG-ASUS/MERGURY/SIRIUS 5 -TRACK TAPE CODE 

Because so much tape editing and printing equipment with this code is 
in use, arrangements have been made far it to be read into Orion and Atlas. 

A table lookup procedure will be used to convert to the basic 6-bit code 
inside the computer. 

Note that the significance of the tracks in this code are reversed 
from all other codes - it is unique in this respect; this was one of the 
first codes to be designed for computers, and it followed the track-numbering 
of the CCITT code. 


9. PEGASUS 4-CASE FLEXOWRITER, 5-TRACK TAPE CODE 


A variant of the Pegasus 5-track code which permits the full exploita- 
tion of the facilities of a Flexowriter. Intended primarily for high-quality 
printing of commercial results, with upper and lower-case letters, red and 
black, wide documents, etc. 


10. ORION/ATLAB FLEXOWRITER 7 -TRACK TAPE CODE 

This is the primary tape used for input and output for Orion and Atlas, 
both for program and for data. 

The code is a minor variant of the standard 7-track tape code, with 
upper and lower-case working, the main difference affecting the symbols 
( ) $£ ? & * which have been placed in the lower-case of zone 1. On input 
to the computer, these symbols are transferred to zone 0, by dropping the 
5th bit from track 6. And conversely on output. This arrangement overcomes 
the restriction of the limited number of printing keys (44) on the keyboard 
of Flexowr iters or other typewriters. 

The table shows alternative symbols for certain tape-characters. 

Those for preparing Orion programs are shown first in each pair; the al- 
ternates are for Atlas programs, or for a typical machine for preparing 
commercial data. For the latter applications some users may require special 
symbols, and these should preferably be allocated in conformity with the 
following rules: 

(i) The symbols of the basic 6-bit code should be retained; 

(ii) Any symbol shown in one place in the table should not be moved to 
another . 

(iii) Any symbol not wanted may be replaced by another not already shown. 
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Further, some special machines have already been designed ^using the ^ 
following code-values:- 59, 60, 6 l for Swedish letters A, A , 0 , or for 
f, respectively; and for Flexowriter functions, 8 = change readers, 

9 = change colours, 10 = non— print, 11 = print— restore. 

The Creed equipment used for 7 - track tapes with Orion and Atlas is 
arranged to punch in this code, rather than the standard 7~ track tape code, 
so that tapes will be compatible with Flexowriter tapes. 


HMcGR/EL 
11 .4.61 
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TkACK Noi>. 4 3 21 

Values $ 421 
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PROPOSED Standard 5- Track Tape Code 


Track Nos. 
Valdes 

v < 

tape 

floVEMEKT 


if 


Top 

View 


lluc 

0 

1 
2 

3 

4 

5 
£ 

7 

s 

9 

IP 

11 

12 
13 
It 
15 



0 


• • 

• • 
00 
0 00 


00 

0 

0 0 


0 
0 
0 

0 '000 


Fi SURE LETTER 

SHIFT SHIFT 


No Act i<w 

1 A 

2 6 

C 


4 


7 

8 

4 

11 


0 


D 

E 

F 

<i 

H 

I 

J 

K 

L 

n 

N 


0 



54 3 2 1 

148 42/ 

Figure letter 

Vitl HC 

^ 

SHIFT SHIFT 

It 

0 > 

O P ; 

17 

0 0 

Q 

Ilf 

0 ’ • 

WHo A IC£ 

YOU K 


• ' 00 

3 S 

* 

0 • 0 

T 

21 

0 '0 0 

5 U 

77 

0 '0 0 

6 V 

2J 

0 '0 00 

W 

24 

00 • 

X 

*5 

00 • 0 

9 Y 

it 

00 ' 0 

10 z 

27 

0 0 ' •• 

Figure Shift 

If 

00' 0 

Space 

19 

00 '0 0 

Carriage Return 

3c 

0 0 '00 

Line Feed 

31 

00 ■••• 

Letter Shift/Erase 


I 

-J 

I 


mc&i 

Jo- Z3 !%[ 



proposed Standard 





Zotff 0 


Zone 

TkacK Mm 

4 

654 32/ 

■ 


654 3 2 1 

Values 

tin 

JZlitf 4 2/ 



aifcf 4 2/ 
s ' s ’ 




■ . 

Wk£ 




0 

I 

$ PAC£ 

II 

• 



1 

• 


17 

• # 

TftfF 

2 

• 

Neuunc on CR 

If 

• * 

■ fill* 

Moi/flf£Nr 

3 

# # 

PAPfRTHEOW 0* IF 

19 

• • © • 



4 

» # 

Tabulate 

2° 

• • • 



5 

1 • # 

SfitOiSMCE 

21 

• < # © 



6 

• # # 

Shift Our 

Z1 

9 '00 



7 

•• •# 

Shift /n anj> Runout 

21 

0 >000 


/ 

» 

• ■ 

( 

24 

0 0 > 



<) 

# * # 

) 

IS 

• • • # 



10 

# 9 # 


26 

00 ' 0 

Top 

II 

# • «» 

i. 

n 

0 0’ mm 

View 

ll 

e » # 


u 

00 ' 0 



13 

# • • § 

So 

» 

00*0 0 



14 

# * ## 

* 

}o 

0 0-00 



15 

# « # # # 

/ 











6 - Track Tape Code 




Zone 

z 


Zone 

3 



654 32 1 



654 321 




fl ll 8 4 2 I 



frill 4 2 1 



U* 1 

"" ' — — 


u* 

V 

i 

0 

32 

• 


4< 

00 • 

P 

1 

ft 

0 0 

A 

44 

0 0 0 

Q 

i 

3/ 

0 4 

6 

s- 

0 0 • 0 

K 

3 

35 

0 10 0 

C 

SI 

0 0 0 0 

S 

4 

34 

0 10 

D 

Si 

0 0 ' 0 

T 

5 

27 

0 1 0 0 

£ 

S3 

0 0 ' 0 0 

0 


3f 

0 <0 0 

F 

SH 

0 0 -00 

V 

7 

ft 

0 '0 0 0 

<i 

ss 

00 '000 

00 

w . 

% 

4* 

0 0 • 

H 

ft 


X 

9 

4/ 

0 0’ 0 

I 

S7 

000 ' 0 

Y 

10 

42 

0 0 ' 0 

J 

ft 

0 04 1 0 

z 

11 

4) 

0 0 • 0 0 

K 

S9 

000 • 00 



44 

0 0 > 0 

L 

4* 

0 00 • 0 


+ 

4* 

0 0 ’ 0 0 

M 

*/ 

0 00 ' 0 0 

1 

ARQ 

— 

44 

0 0’0 0 

N 

• 2 

000 '00 

Escape § 

0 

47 

0 0 I • 0 0 

0 

45 

000 '000 

Ekase ^ 


K ZC (U 

ZJ IHU 



PROPOSED 

Zovf O 


STANDARD 7- TRACK 

I Zone 1 I 


TAPE CODE 

Zone Z 


Zone 3 


TkAcK <V<*. 


7454 321 



7454 32 f 

SlftLUES 


miPt H2\ 



3211,91 42/ 



l4lm 

r— — 


iLU 

r> — ' — 



0 

• 

SpACc 

1 b 

0 



1 

0 


17 

0 0 


2 

• 0 

hlEULINE OR CR 

IS 

00 ' 0 

IAP£ 

3 

0 '00 

fyfemRou or LF 

If 

0 '00 

Mo m&T 

4 

' 0 

Tabulate 

> 

00 ' • 



? 

0 ' 0 0 

Backspace 

21 

0 '00 



4 

0 '00 

Shift Out 

22 

0 '00 



7 

10 0 0 

Shipt In and Runout 

23 

00 ' 0 00 

4 

r 

* 

■ 0 • 

( 

24 

* • * * 

f 

4 

00 • 0 

) 

2/ 

• • ■ 0 



10 

00 • 0 


26 

0 0' 0 

(op 

K 

0 ■ 0 0 

l 

27 

000 ' 00 

\/<cW 

ft 

00 • 0 


2* 

0 0 • 0 



(3 

0 * • 0 

So 

2f 

00 0 ’ 0 0 



14 

0 '0 0 


Jo 

000 ' 00 



If 


/ 

Jf 

0 0 '00 0 


0 

1 

2 

3 

4 

5 

<o 

7 

8 

9 

10 
11 


+ 


7454 321 
32ILV1 421 
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FRED W. SMITH, Assistant to Telegraph Equipment Engineer 

A Proposed New Teleprinter Code 
for Error Detection 


In recent years a considerable amount 
of study has been devoted to error-detec- 
tion and error-correction systems for tele- 
graph and data communications. A 
number of articles on this subject have 
appeared in Technical Review and many 
papers on the subject are presented each 
year before various engineering societies. 

Emphasis has been placed on error de- 
tection, since a detected error can always 
be corrected, and also on use of systems 
and methods for data communication, 
where an undetected error can prove 
costly. The various codes used in telegraph 
and data communications have received a 
great deal of attention and many new 
codes have been developed for special pur- 
poses, with emphasis on error detection 
and error correction. The 5-unit Baudot 
code, long recognized as illogical in many 
respects, has also been studied 1 with a 
view to improving the character assign- 
ment to provide some protection against 
undetected errors. To date, no 5-unit code 
with an improved character assignment 
has been used in the United States, but 
such a code is now in limited use in 
Europe. 

The 8-Level Data Code 

In data communications there is a defi- 
nite trend towards use of codes having 
more than five intelligence levels, or 
pulses, and some communications special- 
ists feel that the 8-level code will some 
day be the standard code for data and 
message communication. One 8-level code 
which appears to be favored for this pur- 
pose at present would utilize five of the 
intelligence levels to obtain 32 possible 
code combinations. A sixth level would be 
used as a shift level; that is, to distinguish 
between upper-case (figures or symbols) 
and lower-case (letters) printed charac- 
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ters. A seventh level would be used for a 
vertical parity check; that is, a seventh 
marking pulse would be automatically 
added where necessary to insure that 
every combination contained an odd (or 
even) number of marking pulses. An 
eighth level would be used for control 
purposes and could be utilized for any 
purpose required. The five levels in this 
code used for obtaining the 32 code com- 
binations could employ either the present 
Baudot code assignment or a new and 
more logical assignment to provide some 
protection against undetected errors. 

Use of an 8-level code to obtain only 32 
possible code combinations for selecting 
printed characters or teleprinter functions 
appears to be highly inefficient because of 
the increased line frequency required for 
a given speed of operation and because of 
the more complex mechanical apparatus 
required to handle an 8-level code. The 
primary purpose of this article is to pro- 
pose at least one attractive alternative to 
the use of an 8-level code in future tele- 
graph systems, and to attempt to stimulate 
further study of a possible standard code 
for future telegraph and data communi- 
cations use. Since adoption of a new 
8-level code in the telegraph industry 
would involve extensive design of new 
equipment and/or redesign of existing 
equipment, a most careful and thorough 
consideration of all factors involved is 
justified in order to avoid placing undue 
limitations on the versatility and flexibil- 
ity of any new system developed. 

The principal advantage in using a sixth 
level as a shift level appears to be that it 
makes possible a keyboard more like a 
typewriter keyboard than any teleprinter 
keyboard now in use; that is, a 4-row key- 
board which permits transmitting a num- 
ber without first operating a figures-shift 
key can be designed. To send an upper- 
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case character other than a number, how- 
ever, it would be necessary simultane- 
ously to depress a shift key and a 
character key in order to insert the sixth 
marking pulse. This might be attractive to 
customers who do not use the present 3- 
row teleprinter keyboards. 


to preparation of his article, Steeneck de- 
veloped a “Christmas Tree” analysis chart 
of all binary codes up to and including the 
9-level code. Steeneck’s analysis chart 
(Figure 1) shows, for each code, the num- 
ber of combinations possible for each 
number of marking pulses (or bits) pos- 


0- BIT 
CODE 

1 - BIT 
CODE 

2- BIT 
CODE 

3- BIT 
CODE 

b- BIT 
CODE 

5- BIT 
CODE 

6- BIT 
CODE 

7- BIT 
CODE 

8 -bit 

CODE 

9- BIT 
CODE 









1 (O-BIT) 








1 (O-BIT) 







1 (O-BIT) 

9 ( 1 -BIT) 






1 (O-BIT) 

80 -bit) 





1 (O-BIT) 

70-BIT) 

36 ( 2 -B 1 T ) 




1 (O-BIT) 

6(1 -BIT) 

28 (2-8 IT): 



1 (O-BIT) 

50-BIT) 

21 (2-BIT); 

8 U ( 3 -B 1 T ) 


1 (O-BIT) 

’t(l-BIT) 

!5(2-BIT) 

56 ( 3 -B 1 T ) 


1 (C-BIT) 

30-bit) 

10 ( 2-B IT), 

35 (3-B i T ) 

1 26 (U-B 1 T ) 

1 (C-BIT) 1 

2(1 -BIT) 

6(2-8 1 T ) 

20 (3-BIT) 

70 (U-B 1 T ) 

1 (1 -Bl T) 

3 ( 2 — B 1 T ) 

I0(3-BIT) 

35 (4-b l T ) 

1 26 ( 5 -B 1 T ) 


1 (2-BIT) 

U(3-BIT) 

15(4-8IT) 

56 ( 5 -B 1 T ) 

i 

i ( 3 -bit) 

5 (U-bit) 

21 (5-BIT) 

8 U ( 6 -B 1 T ) 



1 (U- 6 IT) 

6(5-BIT) 

28 ( 6 -bit) 




1 (5-BIT) 

7 ( 6 -b i t) 

36(7-81 T) 





i ( 6 -bit) 

8 ( 7 -b it) 






1 (7-BIT) 

9 ( 8 -b 1 T ) 







1 (8-bit) 








1 (9-BIT) 










Figure 1. Steeneck's analysis chart of binary codes up to nine levels. 


Use of a vertical parity check for each 
character transmitted provides appreci- 
able protection against occurrence of un- 
detected errors and a vertical parity check 
can be made with relatively simple cir- 
cuitry. However, such a check is not 
nearly as effective as a horizontal parity 
check and the use of an additional intel- 
ligence level solely for the purpose of 
providing a vertical parity check deserves 
careful study. In any event, preliminary 
tests indicate the latter to be only 90-per- 
cent effective. 

Control characters are, of course, essen- 
tial in most modern telegraph and data 
communications systems and it is essen- 
tial to provide for such characters in any 
future code developed. Again, however, 
use of one additional intelligence level 
solely for this purpose should be carefully 
evaluated. 

A Proposed New 6-Level Code 

In a recent article in Technical Re- 
view’ Robert Steeneck pointed out some 
of the error-checking possibilities con- 
cealed within the 5-level code. Subsequent 


sible. For example, in a 2-level binary 
code there is one combination which con- 
tains no marking pulses, two combinations 
which contain one marking pulse, and one 
combination which contains two marking 
pulses. While it is not possible to have a 
binary code containing zero levels, Stee- 
neck added a theoretical zero-level code 
for esthetic reasons. 

The possibilities revealed by the 
Steeneck chart and the design work now 
being done on at least one 6-level tele- 
printer led to consideration of the possible 
use of a 6-level code with the sixth level 
being used for purposes other than shift 
control. Referring to Figure 1, it can be 
seen that the 6-level binary code contains 
6 combinations with a single marking 
pulse, 20 combinations with “triples,” and 
6 with “quintuples.” There are 15 “dou- 
bles” and 15 “quadruples.” The combina- 
tion with no marking pulse and the one 
with all 6 pulses marking complete the 
64 combinations possible with a 6-level 
binary code. 

Note that there are 32 combinations 
which contain an odd number of marking 
pulses and 32 which contain an even num- 
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her, if zero is considered an even number. 
It is thus possible to obtain 32 combina- 
tions of the 6-level code, each of which 
will have an odd number of marking 
pulses, or 32 combinations, including the 
all-spacing combination, each of which 
will have an even number of marking 
pulses. This, of course, is the same number 
of combinations obtained with the stand- 
ard 5-unit Baudot code. 

Figure 2 shows all of the combinations 
in the fi-level code which contain an odd 
number of marking pulses, and Figure 3 
shows the combinations which contain an 
even number of marking pulses. (Note 
that the code levels in both figures have 
been numbered from 0 to 5, rather than 
1 to 6, to correspond to the numbering sys- 
tem recommended by the AIEE Commit- 
tee on Tape Standardization.) Thus, either 
of these two groups of combinations could 
be used to obtain the 32 different charac- 
ters now commonly used in telegraphy, 
and either group would provide a built-in 
parity check, either odd or even, depend- 
ing on the group chosen. If either of these 
two codes is adopted for future record 
communications use, a serious effort 
should be made to make it a “common 
language” code, equally useful for mes- 
sage and data communications. Some of 
the factors which should be studied will 
be discussed in the following paragraphs 
in an effort to determine which of the two 
proposed codes appears more attractive. 


Protection against errors can be pro- 
vided to a considerable degree by careful 
assignment of the most frequently used 
characters to the code combinations least 
susceptible to errors. In making such an 
assignment, the following factors are most 
significant: 


1. A decision should he made as to which 
characters should be given maximum 
protection and an order of priority 
established for the 32 characters. 


The many types of errors which occur 
in telegraph communications and their 
frequency of occurrence should be 
determined and an order of priority 
of the code combinations established, 
based on the frequency of occurrence 
of the most common types of errors. 


The two orders of priority should then 
be matched to obtain the most desir- 
able code assignment. 


Perhaps there are other factors 
which could be considered but these 
appear to be the most important. 


Several years ago, Western Union made 
a study of the frequency of occurrence of 
the letters of the alphabet in the English 
language. Although this study was not 
based on telegraph messages, but rather 
on newspaper stories, it is unlikely that 
telegraph messages would be radically 
different. The order of occurrence of each 
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Figure 2. A 6-level binary code using only combinations with an odd number of marking pulses. A number denotes 
a marking pulse and absence of a number denotes a spacing pulse. 
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letter, with the frequency of occurrence 
per 1000 letters, was: 


Letter 
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WOO Letters 

Letter 
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1000 Letters 
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Considering only the letters of the 
alphabet, a priority based on the foregoing 
tabulation could be assigned. However, in 
telegraph and data communications, pro- 
tection of numbers is of utmost impor- 
tance, since it is difficult to detect an error 
occurring in a number whereas an error 
in a letter in normal English can usually 
be detected and corrected. Also, protec- 
tion of the five teleprinter functions (car- 


riage return, line feed, figures shift, let- 
ters shift and space) must be considered. 

It would be highly desirable to assign 
the ten numerals to the ten most fre- 
quently used letters in the alphabet, so 
that maximum protection could be given 
simultaneously to the numbers and to the 
ten most frequently used letters. Unfor- 
tunately, this would involve a radical 
change in the layout of 3-row keyboards. 
With a 4-row keyboard, the change could 
be made without seriously affecting the 
present standard layout, since the num- 
bers keys would be separated from the 
letters keys and the code combinations 
assigned to the numbers could correspond 
to any group of ten letters. 

Reassignment of the numbers would 
make it necessary to change some of the 
punctuation marks, but this could prob- 
ably be tolerated. On the other hand, the 
3-row keyboard is based on the physical 
relationship between the numbers and 
letters keys on standard typewriters and 
any change would make it difficult for a 
touch typist to use the touch system on 
such a keyboard. For purposes of this 
discussion, then, it will be assumed that 
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Figure 3. A 6-level binary code using only combinations with an even number of marking pulses. A number 
denotes a marking pulse and absence of a number denotes a spacing pulse. 
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•i-row keyboards will continue in use in 
telegraph communications and that the 
numbers cannot be assigned to other let- 
ters; i.e., the present assignment must be 
maintained. The numbers one through 
zero are at present assigned to the letters 
QWERTYUIOP, in that order. The letters 
QWYUP are not among the ten most fre- 
quently occurring letters of the alphabet, 
but the remaining five in the “numbered” 
group are. All ten of the letters in this 
group must be protected, however. 

An “average” word consists of five let- 
ters, so that a teleprinter must perform a 
spacing function an average of one time 
m every six operations of the teleprinter. 
The frequency of occurrence of the spac- 
ing character is thus approximately 200 
times per 1200 characters and this fre- 
quency is even higher than that of the 
letter E. Although loss of, or an error in a 
spacing character seldom results in an 
undetected error, this character does 
deserve a high priority in the code assign- 
ment because of its high frequency of 
occurrence. Errors in the figures and 
letters shift characters seldom result in an 
undetected error and they are less fre- 
quently used than the space; therefore 
these characters deserve less priority than 
the space. However, it is more difficult to 
correct errors due to shift failure without 
resending the message than it is to cor- < 
rect a spacing error, so that these two im- i 
portant function characters should be t 
given some priority over little-used char- c 
acters. Similarly, the carriage return and t 
line feed characters seldom result in an a 
undetected error but a single failure of r 
one of these two characters can result in ii 
making an entire line of printing unintel- p 
hgible. They thus deserve a rather high c. 
priority, perhaps higher than that assigned o< 
to the letters and figures characters. tc 

Considering the three foregoing factors tc 
(that is, frequency of occurrence of let- P’ 
ters, protection of numbers, and protection cJ 
of functions) , a number of logical orders P* 
of the priority of the 32 characters now lo 
used could be developed, depending on the ta 
weight given to each factor. The following 
order is suggested as just one possibility; to 
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Priority 

Character 
L.C. U.C. 

Priority 

Cha 

L.C 

i . 

Space 

17... 

A 

2 

E 3 

18 . . 

s 

3 . . 

T 5 

19 

D 

4 

R 4 

20 

. L 

5 

1 8 

21. . . 

C 

6 

O 9 

22. . . 

H 

7 

u 7 

23 

F 

8 

P 0 

24 . 

M 

9 

Y 6 

25 

G 

10 

W 2 

26 

V 

11 

Q l 

27 . . . 

B 

12 

Car. Ret 

28 . 

X 

13 

Line Feed 

29 

K 

14 . 

Figures 

30 

J 

15 

Letters 

31. 

z 

16 

N , i 

32 

Blank 


} T yP es of Errors and Frequency 

" w A * nUm rf r ° f studies have been made by 
. Western Union to determine the types of 
, errors and their frequency of occurrence 
in messages switched over the public mes- 
sage network and over one private wire 
network. The statistics resulting from 

ese various studies do not agree in all 
respects but there is agreement as to 
which types of errors occur most fre- 
quently. 

Loss of a single marking pulse is the 
most common type of nonoperator error 
and the field studies reveal that from 37.7 
o percent of all nonoperator errors are 
due to loss of a single pulse. The first pulse 

^ l0 fif + T 0re , freqUently than ar W ot ber and 
the fifth pulse ranks second in frequency 
ot loss. In normal message traffic each of 
these two pulses occurs in 40 percent of 
all characters transmitted. The third pulse 
ranks third in frequency of loss and occurs 
in 53 percent of all characters. The second 
pulse, which occurs in 48 percent of all 
characters, and the fourth pulse, which 
occurs in 40 percent of all characters, 
together account for fewer total errors due 
to loss of a single pulse than does the first 
pulse. The studies also revealed that a 
character containing a single marking 
pulse is more susceptible to error due to 
loss of a single pulse than a character con- 
taining several marking pulses. 

Gain of a single pulse caused from 4 9 
to 14.3 percent of all errors. (These results 
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do not agree with controlled laboratory 
tests made on marginal equipment and cir- 
cuits, which show that 90 percent of all 
errors 1 are due to gain or loss of a single 
pulse.) The remaining errors were caused 
by simultaneous loss and gain of pulses, 
loss of an entire character, gain of mul- 
tiple pulses, and loss of multiple pulses, in 
that order of frequency. 

It seems logical to assume that in a 
0-level code the first and sixth pulses 
transmitted would be most susceptible to 
loss, rather than the first and fifth. If this 
assumption is accepted, any 6-level code 
adopted should assign the lowest possible 
priority to combinations containing both 
of these pulses and the next lowest to 
combinations containing only one of these 
pulses. Referring to Figure 1, it can be 
seen that a 4-level code contains six com- 
binations with two marking pulses, one 
with four marking pulses, and one with 
no marking pulses. Therefore, a 6-level 
even-parity code will contain eight com- 
binations with an even number of marking 
pulses, without either a first or sixth 
marking pulse, if the blank combination 
is included. However, use of the blank 
combination is considered undesirable and 
it is seldom used. Figure 1 also reveals 
that eight combinations with an odd num- 
ber of marking pulses are available with- 
out using either the first or sixth pulse. 

A high degree of protection against un- 
detected errors in numbers and vowels 
should be provided by putting all numbers 
and all vowels in the same group; that is, 
in a group of combinations each of which 
contains the same number of marking 
pulses 1 . Since 4 of the 5 vowels have 
numbers assigned to the upper case, only 
11 combinations in a group are necessary 
to accomplish this; it could be done with 
either the odd-parity or even-parity 6- 
level code. In the odd-parity code shown 
in Figure 2, only “triples” could be used. 
In the even-parity code shown in Figure 3, 
either “doubles” or “quadruples” could 
be used. In the odd-parity code there are 
only 4 combinations of “triples” which 
contain neither a first nor a sixth pulse. 
In the even-parity code there are 6 
“doubles” which contain neither a first nor 


a sixth pulse and only one “quadruple” 
which contains neither of these two pulses. 
Use of the even-parity code with the num- 
bers and vowels assigned to the “doubles” 
therefore offers some advantage over use 
of the odd-parity code. The following code 
assignment is therefore proposed for the 
even-parity code, considering all of the 
factors previously discussed: 


Combina- 

Priority lion No. Code Character 

Number (Pig. 1 ) Combination L.C. U.C. 


1 6 -12 Space 

2 . .... 7 -1-3 E 3 

3 ..... 8 -1 4 — T 5 

4 10 2 3 . R 4 

5 11 2-4-1 8 

6 13 3 4-0 9 

7 26 - 1 2 3 4 -.. Figures 

8 9 -1 5 U 7 

9 12 2 5 . p 0 

10 .14 3 - 5 Y 6 

11 15 45 W2 

12 1 0 1- Car. Ret. 

13 . ... 2 0 — 2 Line Feed 

14 3 o - - 3 - - Q 1 

15 ... 4 0 4 - A _ 

16. 27 - 1 2 3 - 5... Letters 

17 28 - l 2 - 4 5 N 

18 29 - I - 3 4 5 S’ 

19 . 30 2 3 4 5 ... D $ 

20 .16 0 1 2 3 - - . L ) 

21 . .... 17 012-4-...C : 

22 ,19 0 1 - 3 4 -...H # 

23 22 0 - 2 3 4 -...F fl 

24 18 0 1 2 5 M . 

25 20 0 1 — 3 — 5. 0 & 

26 21 0 1 - - 4 5 V ; 

27 23 0 - 2 3 - 5 B ? 

28 24 0 - 2 - 4 5. . X / 

29 25 0 3 4 5 K ( 

30 31 0 1 2 3 4 5 . J Bell 

31 5 0 5. Z ” 

32 32 Blank 


Note that the priority previously given 
was modified to place all of the numbers 
and all of the vowels in the “doubles” 
group, so that loss or gain of a single pulse 
or of two pulses would not result in print- 
ing a wrong number or a wrong vowel. 
The seventh ranking combination in the 
order of priority was then available for a 
character other than a number or vowel. 
The figures shift was assigned to this com- 
bination because of its importance in data 
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communications. Unfortunately, this re- 
sulted in a relatively low priority for the 
letters N, A and S, all three of which are 
among the ten most frequently occurring 
letters in the alphabet. 

The above priority list can be divided 
logically into three groups. The first group 
consists of priorities 1 through 7 , which 
do not contain either a zero or fifth mark- 
ing pulse. The space and 6 printing char- 
acters in this group would account for 649 
out of every 1200 printing and spacing 
characters in normal message traffic. The 
second group consists of priorities 8 
through 23, each of which contains either 
a zero or fifth pulse, but not both. The 
pi inting characters in this group would 
account for 473 out of every 1200 printing 
and spacing characters. Also, 3 of the 5 
function characters are in this group. The 
third group consists of priorities 24 
through 32 and each of these combinations 
contains both a zero and a fifth marking 
pulse. The characters in this group would 
account for only 18 out of every 1200 
printing and spacing characters and there 
are no function characters in this group. 

Perhaps a more exhaustive analysis of 
the error protecting possibilities in a 6- 
level code would result in considerable 
improvement in the foregoing code assign- 
ment, but this one at least would be a 
tremendous improvement over the 6-level 
code now in limited use in the telegraph 
industry, where the sixth level is used 
only for shift-unshift, a vertical parity 
check is not possible, and the code assign- 
ment is more or less haphazard. 

The proposed code would prevent a 
printing error due to loss or gain of a 
single pulse. The 15 “doubles” combina- 
tions would prevent a printing error due 
to loss of two pulses, and the 15 “quad- 
ruples” would result in printing a J (or bell 
in upper case) if two pulses were gained. 

In most cases, a J (or bell symbol) errone- 
ously printed for another character would 
be an obvious error and therefore would 
not go undetected. All of the combinations 
would be protected against undetected 
loss or gain of more than two pulses. This 
code offers no protection against simul- 
taneous loss and gain of the same number 
of pulses within a character combination. 


This could happen occasionally if the error 
rate was unusually high and the chances 
of gam and loss of pulses were about 
equal. In practice, however, the error rate 
is never that high on an operating circuit 
and error conditions usually consistently 
produce pulse losses or, more rarely, con- 
sistently produce pulse gains. Therefore, 
the probability of compensating errors 
occurring in this type of error check is 
quite remote. 

It might appear wasteful of transmission 
time to use only 32 combinations out of 
the 64 available with a 6 -level code. How- 
ever, the 32 combinations which contain 
an odd number of marking pulses could be 
used for such purposes as switching con- 
trol or horizontal parity checking. Switch- 
ing characters or horizontal parity check- 
ing characters could be preceded by a 
conditioning code” combination to bypass 
temporarily the vertical parity checking 
circuit and, if desired, to put the tele- 
printer in a nonprint condition. Following 
the switching or horizontal parity check- 
ing combinations, an end-of-switching or 
end-of-parity-check character could re- 
store the circuit and the teleprinter to the 
normal printing condition. The 6 -level 
code used in this manner would make con- 
siderably more efficient use of transmis- 
sion facilities than the 8 -level code previ- 
ously described and would also make it. 
possible to use a higher transmission speed 
for a given bandwidth. 

Mechanical Apparatus for the 6-Level Code 

Design of tape transmitters, distributors, 
distributor-transmitters, keyboard perfo- 
rators, and nontyping reperforators for 
use with the proposed code would present 
no special problems and, in fact, many of 
these units are available for use with a 
6 -level code. They could be used with the 
proposed code with little or no modifica- 
tion. A transmitter or distributor-trans- 
mitter could be used with a parity-check 
circuit which would stop transmission and 
actuate an alarm when the reading pins 
read an odd number of marking pulses. A 
similar circuit could be used with verify- 
ing contacts or reading contacts on a 
reperforator to perform the same function. 


j 
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At present there is no start-stop tele- 
printer or typing reperforator which could 
be readily modified to operate with the 
proposed code, but there appear to be no 
serious obstacles in the design of such 
units. A typebar teleprinter such as the 
Siemens and Halske Type 100- could be 
designed readily for six levels and it is 
possible that a large number of standard 
parts could be used in the 6-level tele- 
printer. A teleprinter using an aggregate- 
motion type of selector would be more 
difficult to design and it is doubtful that 
any of the present aggregate-motion selec- 
tors could be modified to use many of the 
standard parts. These comments also 
apply to start-stop typing reperforators. 

A teleprinter with a 2-color printing 
mechanism such as that used on most 
European teleprinters- could be used to 
detect a parity-check failure by energizing 
a magnet to shift the red half of the ribbon 
to the printing position. The parity-check 
circuit could be arranged to lock the mag- 
net in its operated position until manually 
released, so that all characters following 
the error would print in red. An asterisk 
or other symbol could be printed on re- 
ceipt of the blank character to guard 
against undetected loss of two marking 
pulses in one of the “doubles combina- 
tions. A more detailed study would no 
doubt reveal other possibilities. A tele- 
printer could be designed to space on 
every signal train received, whether a 
valid character or not. This would give 
excellent protection against undetected 
loss or gain of a single pulse, the most 
common cause of all errors, without the 
use of a parity-check circuit. 

Conclusion 

It is doubtful that it will ever be 
economical to design a communication 


system to provide 100-percent protection 
against errors, but it is economically feas- 
ible to design a reliable system approach- 
ing 100-percent accuracy. The 6-level code 
proposed here offers extensive possibil- 
ities. Apparatus designed for use with this 
code would provide a very large decrease 
in both detected and undetected errors, 
without providing parity-check circuitry, 
but it is not possible with the meager 
error-study data available accurately to 
predict exactly how much improvement 
could be expected. 

New carrier systems have been devel- 
oped which considerably reduce transmis- 
sion errors and newer systems which will 
result in even greater improvement are 
under development. Since mechanical 
equipment is the source of a large per- 
centage of errors, a new and drastically 
different approach to error prevention in 
mechanical apparatus is needed to keep 
pace with the improvement being made 
in transmission facilities. It is hoped that 
this article will stimulate a study of this 
need and perhaps lead to a new approach 
to the error prevention possibilities within 
the 6-level code. It is possible that 6-level 
equipment such as that proposed could be 
incorporated into one of the broadband 
switching systems now under develop- 
ment for data communications, with re- 
sulting simplification in the system and 
consequent reduction in cost. 
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EDITOR’S NOTE 

This paper describes a tentative standard code which has been proposed to the 
membership of the ASA X3.2 Subcommittee on Character Sets and Input/Output 
.Media. The X.'i.2 Subcommittee is now examining this work in detail and, in my 
opinion, will probably recommend as an American Standard a code which is quite 
similar in principle to the one described here although it mav differ in some de- 
tails. Because of the urgency of this work and the very great amount of effort 
which has already gone into the development of this proposal, it is presented 
here, informally, for the information and scrutiny of the ACM membership 
although at proof time it had not yet been reviewed by the ACM Standards Com- 
mittee for consideration as an ACM Standard. The authors and this Department 
will welcome your comments. 


Design of an Improved * 
Transmission/ Data Processing Code 


R. W. Bemer, H. J. Smith, Jr., F. A. Williams, Jr. 

IBM Carp., White Plains, .V. Y. 

Historically there has been strong difference of opinion 
in the construction of 6-bit (64-character) data codes, 
based upon whether the code is to be used for communica- 
tions or data processing. This paper reports on investiga- 
tion of an improved code which meets transmission re- 
quirements and requires very little modification for varied 
data processing usage. 

It has been evident from the workings of the ASA 
Subcommittee X3.2 that the transmission people are not 
as adaptable to modifications as the data processing people. 
This is simply a matter of inflexibility of existing semi- 
mechanical communications equipment compared to the 
general-purpose nature of electronic data processing equip- 
ment. 

The major obstacle lies in the collating, or ranking, 
sequence of the characters of the set. It is true that a 
laige proportion of the ordered files of today are sequenced 
on numeric keys alone. However, a substantial proportion 
of these files are ordered on keys which contain alphabetic 
and special punctuation characters. If a standard code 
changes the relative ranking of such characters the pres- 
ently ordered files will all have to be fully reordered to the 
new sequence, a process requiring a great expenditure of 
machine time. Transformation of one bit representation 
to another is relatively simple when the sequencing 
property is ignored. However, one should try to guarantee 
that the files are still in proper order after such conversion 
[ 11 . 
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There are three inputs to the collating problem: , 1 

(1) The most prevalent ranking in the U.S. is tl 
established by IBM equipment, particularly the 705| 
order are the blank, special characters, the alphabet, ! 
digits. The critical point here is that the digits are higi 
than the alphabet, for whatever reason. The United ! 
dom and certain other U.S. manufacturers (Sperry . 
and RCA) rank the digits lower than the alphabet. • 

(2) The desire of communications people, as 
evidenced by Fieldata [2, 3, 4], is to have the 6-bit; : 
collapsible to a 5-bit Baudot-type set with effectively::: 
same characters. This is to utilize existing Baudot-Tele 
equipment with simple modification. 

(3) Certain punctuation characters, by univers 
accepted practice, should collate low to alphabets, i 
and other special characters. For example, the folio 
two names would normally be ordered : ' 

Roberts, A. B. 

Robertson, X. 

whereas the Fieldata code, because the comma i 
higher than the alphabet, would yield an ordering: :■ 

Robertson, X. 

Roberts, A. B. 

Expansion and contraction between any of the 4-, 
i- and 8-bit code sets demand a certain uniformity? 
simplicity. Thus the alphabet should be reserved to 
contiguous quadrants of the four quadrants of the 
set. The choice now appears as in Figure A. 
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In tin; opinion of the authors neither the Fieldata code 
nor the l .K. code meet the criterion for 5-bit Baudot-like 
operation completely, even though that was one of the 
major design requirements. A Baudot type of code is 
formed essentially as follows: 


Digits and special 


Control 


•>2 5-bit combinations 


In any 2-mode code for paper rape, three of the control 
codes, delete, figure shift and letter shift, must in- 
variably be common to both shifts, delete must be all 
l’s (all punched on paper tape) and master space must be 
all 0’s ( unpunched tape), master space, blank, and 
EstAPE preferably appear in both shifts. Such controls as 
lim: feed, carriage return need appear in only one 
s! ifr. but operation is more complicated. 

■""me of these functions may be combined in a single 
code combination, delete letter shift is a single code 
in Baudot, figure shift is synonymous with one of the 
three functions possible to escape. [(i| 

Since dl, fs and ls must be common to both modes, 
fieldata loses the Y and Z of the alphabet and the — and 
+ characters in the collapsed 5-bit mode. This is not 
tolerable because some words are spelled using Y and Z. 
Mmilarly, the L .K. code loses the letters F and G and the 
S' mbols . and . The code developed in this paper is very 
similar to both of these codes but removes these major 
flaws. 

All ot the criteria of the fieldata st-udv are used here 
flic lull spectrum of expansion and contraction among 
-f-, .)-, (i-, 7- and S-bit sets is considered in addition. Thus 
there are the following additional criteria and remarks: 

1. A collating sequence has utility in data processing 
codes containing alphabets; transmission codes do 
not require such a sequence. 

2. A collating sequence has no utility in a 4-bit set. 

A collating sequence has utility in 5- and 6-bit sets 
and it is desirable that the sequence correspond to the 
binary representations. 

4. If it is assumed that the i- and 8-bit sets contain upper 
and lower case forms of the same alphabet, it is im- 
possible to have the collating sequence match the 
binary representations, for the case distinction is of 
lesser significance than the distinction between charac- 
ters with different meanings. [7, 8] 

5. It is not necessary that, the full 4-bit set be in 1C 
contiguous positions in larger sets. It is only necessary 


that the lowest four bit positions form the dense, un- 
duplicated set. Other bit positions may vary. However, 
the digits 0-9 (10, 11) should be certainly be grouped 
contiguously in any set. 

6. Punctuation characters have natural delimiting 
functions and should thus collate low to both the 
alphabet and digits. These include, but are not limited 
to: 

blank .,/-:;'() (not in ranked order) 

7. Since period and hyphen are natural delimiters, they 
should be placed low to both alphabets and digits. 
However, they often serve as radix point and minus 
sign (which are not delimiters) in the 4-bit numeric 
set. There must, also be a character in this set to serve 
as a blank; this may or may not print in the 4-bit 
numeric mode. Therefore any characters of the 4-bit 
set which are delimiters should be in a different con- 
tiguous block than the digits, so they can serve the 
delimiting function in larger sets. There should be 
some regular transformation to append bits when 
expanding to larger sets. 

8. All expansion and contraction from and to the various 
set sizes shall be blind, without knowledge of the 
meaning of the character assigned to any bit repre- 
sentation, or of contextual adjacency (with the excep- 
tion of figure/letter shift control in going between 
5- and 6-bit sets). 

9. In all expansion and contraction, master space must 
remain all 0’s and delete must remain all l’s. escape 
shall always be the second highest code, one less than 
delete; thus all bits except the low order are 1. For 
paper tape usage, blank must be different from 
master space and therefore shall have all bits 0 except 
the low order. This guarantees that blank, as the 
primary delimiter, collates low to all other characters. 

It is also the complement of escape. 

10. All possible caution should be exercised ip alphabetic 
regions to provide maximum expansion for non- 
English alphabets (> 26 letters). 

The 8 bits are represented by B 7 through B„ , high to 
low order. The 6-bit transmission set will be developed 
first. Figure 1 shows a modified Fieldata pattern with B 5 
not yet assigned, reflecting criterion 9 only. B 6 = 0 for 
Fieldata, B 5 = 1 for U.K. 

It is now obvious that ls and fs should be opposite 
es and dl, not ms and blank, in order to maximize the ■ 
number of punctuation characters following blank. ‘ 
Since the decimal digits must have their binary repre- 
sentation equal to the binary value, they must be placed 
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ill the XL quadrant. This is shown in Figure 2. link feed 
(lf) and cahriaoe retuh.v (cr) are necessary for charac- 
ter-at-a-time 1 printers associated with existing communi- 
cation systems. As control signals, they are grouped with 
other control signals rather than with ms and blank, 
whiv' 1 are essentially informational. 

T : re is space for 20 special characters in the 6-bit set, 
bur pair of these must disappear in the 5-hit set. 

In conformity to most existing practice, the other sLx 
characters of the 4-bif set have been selected as: 

.— 4- (for self-delimiting data fields) 
b printing separator (may have graphic repre- 

sentation) 

j ( hgit grouping /most expendable for 

* indicator tor totals, etc. /British pence (10, 11) 

U.e two positions following the digit 0 are not usable 
for delimiters in the (i-bit set, since they will collate high. 
These six characters are assigned as shown in Figure 0. 
file pairs (.,) and ( F) area distance of two bits apart, 
toi easier error detection. -f- is used fully interchangeably 
with &, since it may also take the forms I &. / has 

been chosen as alternate for the blank in the 4-bit set, 
since it can serve very well as a space indicator, as 
00 («li 505. ,//2 1 . 
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Transmission Code 


Data Processing 
Code 

Fig. 5. The Proposed Standard Code 

1 I he common term in communications is “page” printer; 
low ever, printers which print an entire page at one time must 
Preempt this term. 


The basic set is achieved by changing the transform at 


®. 

B< = B 3 V (Bo A Bo 

4 he British set should have 10 and 11 immediately 
following 9. This is achieved by changing the transform 
at ®. 

B, = B 3 V Bo 

F igure 4 gives the special characters specified in existing 
systems. 

A I 1 oirniAN-eommercial substitution exists to overcome 
limited capacity of line printers. The correspondence is: 

« to = @ to ' % to ( □ to ) 

The Bell and who are you” functions are ignored here 
because they do not warrant individual characters. They 
are handled best by the escape mode. 

B 5 mayjiow be assigned specifically. 

X = 0, X = 1 yields a modified Fieldata which, 
unless transformed, has punctua- 
tion high to the alphabet. This is 
_ not logically consistent. 

X = 1 , X = 0 yields modified U.K. 

“ Uni vac, MH 

“ RCA 501 
“ 704 internal 

We will thus choose the latter. This choice also di- 
minishes the number of bits or punches in numeric data, 
which is most frequent to data processing. 

The specific proposal of Figure 5 implies either that: 

(a) the data processing code is internal, and the exclu- 
sive or mapping takes place at the interface on reading 
or writing externally on media such as tape or communica- 
tion lines, which utilize the transmission code, or 

(b) the data processing code is merely figurative and 
represents the effective collating sequence obtained by a 
simple comparison logic in the machine. 

The transmission code folds to the Baudot-like code of 
Figure 6. It retains all the special characters of present- 
day Teletype, plus the *. Although ? and 1, as effective 
delimiters, might well precede the alphabet in the data 
processing code (involving a swap with < and >), to do 
so would remove ? and ! from the 5-bit transmission 
code. If the transmission people agree, this change could 
be considered. ! 

As a 6-bit transmission code, ® to ® are available. 
These might be used either for additional control func- 
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tions or (preferably) for additional characters of foreign 
alphabets, fs and ls are also available, once the complete 
change is made from existing equipment. Fieldata would 
then use these characters as upper case (uc) and lower 
case (lc) respectively. This provides a representation of a 
7-bit code in 6-bit form, just as Baudot represents a 6-bit 
code in o-bit. form. Therefore the fs-uc and ls-lc corre- 
spondences are true, and either mnemonic might be used. 
Perhaps a new combination would be desirable, as fu 
(for Figure/Upper) and ll (for Letter/Lower). 

— @ % and □ are placed high in the data processing 
code, and it is assumed they will not be used in control 
keys. Figure 7 shows the dp code satisfying the last set of 
special characters of Figure 4, plus the Fortran trans- 
formation and a 48-character set. 

The proposed set has the special characters assigned 
for reasons other than matching correspondence between 


the digits and the characters associated with those digit 
typewriter keyboards. The reasons are: 

1. Some typewriters do not have keys for the digit 
(1) or even for the digit (0). 

2. There is no such thing as a standard typewriter 

board in the U.S. There is a proposed British Standi 
but the characters placed most uniformly, the left; 
right parentheses, are above 9 and 0 respectively... 
conforms with much practice in the U.S., but 0 mils 
placed in parallel to ms in accordance with our pre 
rules. . . %>, 

3. Transmission people sometimes desire the p 
theses over the 8 and 9 respectively, but this occurs 
in the Luebbert revision of Fieldata (not the ori 
Fieldata) and Ferranti computers. 

4. Users normally adjust automatically to any arraS 
ment of special characters after a day’s usage. Ti 
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5. Xcm-Kiiglish keyboards differ greatly in this plaee- 
inenf- It would be unfair to inflict one of the many English 
arrangements as an international standard. 

6. The Fnimt Ax-Commercial interchange is accom- 
plished in the proposed set by recognizing 

B, A B 4 A (B :i a B 5 ), 

and inverting B :l and B r , if this condition is true. Any 
other arrangement greatly complicates the logical hard- 
ware necessary in converting existing printers, probably a 
more expensive process than converting existing Teletype- 
writers. 

7. Any such correspondence will still require two modes 
of k -y board logic to generate codes. 

T 'tie transmission people could modify existing equip- 
meie with an ioxcli'sivk ok function (two relays), a com- 
pletely common and eollatable 0-bit code could exist, as 
shown in figure S, subject to the requirements for expan- 
sion to 7- and 8-bit, sets. 
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APPENDIX 

On the Relative Position of the Alphabet, Numbers 
and Special Characters in a Code Set Based upon 
I ransmission and Data Processing Characteristics 

Consider a 4-quadrant, arrangement of 16 states per 
quadrant in which Ql, Q2, Q3 and Q4 represent the octal 
codes 00-17, 20-37, 40-57 and 60-77. Consider also four 
t' lasses of information symbols which may be placed in 
these quadrants: 

D representing the 10 or 12 digits of the decimal or duo- 
decimal system 

^ representing the class of special characters 

A representing a section of the alphabet beginning with the 
letter A 

Z representing a section of the alphabet ending with the 
letter Z 


If each of these four classes of information is assumed to 
consist of up to 16 codes, they may be assigned to the 4 
quadrants in any of 24 combinations. We now consider 
these combinations in light of their desirability for data 
processing and data transmission. 

Transmission Considerations 

1. Concepts of master space and blank are distinct. The term 
“blank” refers to the element of information used to separate 
words on a printed page, master space occupies the zeroth 
position. 

2. The concept of erase or delete is represented by the iVth 
character. 

3. master space, blank and delete are concepts required in 
all alphabetic or alphanumeric sets. Further ms and dl 
always occupy the same relative position in each set. 

4. The (54-state code set is to be representable by a 32-state 
code using a shift mode. In this compressed representation the 
alphabet is to form one shift and the numbers and special 
characters the other. 

These criteria imply: 

01. A and Z cannot fold upon each other (4). 

02. Ql and Q4 cannot fold upon each other (1, 2, 3). 

03. master space must be in Ql. (1) 


Data Processing Considerations 

5. The digits are represented by their pure, natural binary 
equivalents. Since only four bit positions are necessary to 
represent up to 16 states, any additional bit position in a given 
set must contain the same pattern of l’s and 0’s for each digit. 

6. No symbol other than master space ranks lower than blank 
in the collating sequence. 

7. The alphabet is dense in collation. 

8. Certain field-separating symbols including blank must rank 
lower than the alphabet in collating. 

These criteria imply: 

04. D cannot be in the quadrant which contains or folds on 
master space. Otherwise 0 and master space would become 
identical in some code set. (5) 

05. Z cannot occupy Ql. (7) 

06. master space and blank must appear as adjacent characters 
in the same quadrant. Otherwise some symbols will be less 
than blank or ms will have a rank higher than blank. (6) 

07 . blank cannot be associated with A. Otherwise some special 
symbols would either be lower in collating than blank, or 
field-breaking symbols would be higher in collating than the 
alphabet which they are intended to separate. (6, 7) 

08. (7) and (8) immediately above imply that A cannot occupy 
Ql since ms must be located here. 

Application of Rules to the Combinations of D, A, 
Z, and S on Ql, Q2, Q3, and Q4 

The requirements 04, 08 and 05 remove from considera- 
tion the 18 combinations beginning with D, A and! Z 

respectively. In addition any combination in which A 

does not precede Z can result in a non-dense alphabet. 

{please turn to page 235) 
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TABLK l 

Denial** of the X ormal Function in Octal Corresponding to the 
Cumulative Area From .5 to 1.0 

\reu < Octal) N 

Scaled 2" 

ormal Deviate (Octal)j 
Scaled 2* 

Area (Octal) |N 
Scaled 2* 

ormal Deviate (Octal) j 

Scaled 2 s * ^ 

200 

00 00000 00000. 

00 00120 15457. 

00 00240 33630. I 
00 00300 53002. j 

300 

00 12625 32704. 

00 12772 42473. 

00 13140 41000. 

00 13307 32316. 


00 00500 73431. 

00 00021 10043. | 
000074143141. 

00 01001 73214. I 


00 13457 10144. 

00 13027 77435. 

00 14001 00730. 

00 14154 44377. 

210 

00 01202 20537. ! 
00 01322 00042. 

00 01443 32032. 

00 01504 03000. ! 

310 

00 14330 34471. 

00 14505 34031. 

00 14003 45573. 

00 15042 74314. 


00 01704 01253. 

00 02025 45515. i 
00 02140 40500. 

00 02207 42513. : 


00 15223 42707. 

00 15405 34403. 

00 15570 55071. 

00 15755 27037. 

•J20 

00 02410 54004. '. 
00 02531 75520. ! 
00 02053 30005. ' 
00 02774 73427. 

320 j 

j 

00 10143 40227. 

00 10333 12007. 

00 10524 33524. 

00 16717 27240. 


00 03110 50535. | 
00 03240 40077. 

00 03302 42443. i 
00 03504 00340. 

i 

00 17114 02241. 

00 17312 41757. 

00 17512 73747. 

00 17715 25671. 

230 

00 03027 12101. : 
00 03751 00400. j 
00 04074 44250. : 
00 04217 45711. ! 

330 

00 20121 05564 . 

00 20330 42335. 

00 20541 43207. 

00 20755 00012. 


00 04342 05010. | 
00 04400 24554. ; 
00 04012 03410. 

00 04730 02547. 


00 21173 00305. 

00 21413 55747. 

00 21037 22316. 

00 22005 70106. 

240 

00 05002 22020. 

00 05200 04474. 

00 05333 51003. 

00 05400 00415. 

340 

00 22317 52300. 

00 22554 65705. 

00 23015 50463. 

00 23202 21220. 


00 05000 13015. 

00 05733 73512. 

00 00002 01012. 

00 00210 34430. 


00 23533 00141. 

00 24010 07632. 

00 24271 74500. 

00 24560 64671. 

250 

j 00 00337 10705. 

1 00 00400 30775. 
00 00015 73071. 
00 00745 70150. 

350 

00 25055 11231. 

00 25357 26315. 

00 25667 74207. 

00 26207 36451. 


! 00 07070 10012. 
00 07227 00405. 
00 07300 10030. 
00 07511 72144. 


00 26536 46016. 

00 27076 02014. 

00 27446 51276. 

00 30031 33024. 

260 

00 07044 03407. 
00 07770 54100. 
i 00 10131 65174. 
00 10205 37400. 

360 

00 30427 40773. 

00 31042 23633. 

00 31473 35502. 

00 32144 76235. 


00 10421 54234. 
00 10550 34507. 
00 10713 01705. 
00 11051 55313. 


00 32641 23053. 

00 33363 41636. 

00 34137 46520. 

00 34752 51075. 

270 

00 11210 20025. 
00 11347 33317. 
00 11507 21011. 
00 11647 02131. 

370 

00 35633 37754. 

00 36573 54572. 

00 37631 00460. 

00 41010 33357. 


00 12011 00143. 
00 12152 74034. 
00 12315 51006. 
00 12401 10401. 


00 42354 42517. 

00 44204 72526. 

00 46534 51133. 

00 52437 23555. 


As can be seen, this is not the only method of applying 


The Fieldata arrangement A, Z, S, D is not acceptable. 


(the character in the first position of Q3) other than 
master space must have a rank less than blank. 

This examination of the possible combinations of S, D, 
A, Z merely indicates which arrangements should be 
further investigated with view of their expansion charac- 
teristics in sets of more than 6 bits. The analysis is in- 
tended to remove much of the confusion which has existed 
as to what combinations are possible and desirable for 
expansion and contraction. Thus, the arrangements 
S, A, Z, D and S, D, A, Z will be given further analysis 
and a choice between them made on the facility of their 
expansion and contraction characteristics. 

Folding Considerations 

At this point the methods of folding must be considered. 
The four quadrants may be folded in one of the two ways 
by the removal of either Bo or B4. 

Ql on Q2 and Q3 on Q4 
Q1 on Q3 and Q2 on Q4 

When the folding consideration is applied to the remaining 
combinations of S, A, D, Z we find, 


SDAZ 

SAZD 

SADZ 


Ql on Q2 
Note 1 
Note 2 
Note 3 


Ql on Q3 
Possible 
Note 4 
Note 1 


Note 1. The digits cannot fold on master space. 

Note 2. The transformation characteristic between 5 and 6 bits 
is dependent upon the combination being treated as 
well as the shift. 

Note 3. This is possible by treating Bi of the 6-bit representation 
as the mode bit. However, this leads to a non-dense 
alphabet. 

Note 4. Z cannot fold on master space. 


Conclusion 

From the consideration of data transmission and data 
processing criteria we are led to a code organization of 
S, D, A, Z. This organization, however, should not be 

considered as giving the collating sequence. ! 
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